Link2Paper
[Author: Qun Huang et al.]
[Keywords: Network Measurement, Sketch]

Motivation

Network measurement is challenged to fulfill stringent resource requirements in the face of massive network traffic. While approximate measurement can trade accuracy for resource savings, it demands intensive manual efforts to configure the right resource-accuracy trade-offs in real deployment. Such user burdens are caused by how existing approximate measurement approaches inherently deal with resource conflicts when tracking massive network traffic with limited resources. In particular, they tightly couple resource configurations with accuracy parameters, so as to provision sufficient resources to bound the measurement errors. Such tight binding leads to several practical limitations:

administrators need to specify tolerable error levels as input for resource configurations;
each configuration is tied to specific parameters (e.g., thresholds);
the theoretical analysis only provides worst-case analysis and fails to guide the resource configurations based on actual workloads;
each configuration is fixed for specific flow definitions;
administrators cannot readily quantify the extent of errors for measurement results.

Contribution

Problem

Design Requirement

Small memory usage;
Fast per-packet processing;
Real-time response;
Generality.

Limitations of Existing Approaches

Hard to specify expected errors (theoretical guarantees based on configurable parameters);
Performance degrades with different threshold (e.g. heavy hitter detection);
Hard to tune configurations;
Hard to re-define flowkeys (5-tuple, src-dst addr pairs, …);
Hard to examine correctness.

Solution

Multi-level Sketch (makes it possible to quantify hash collisions, bit-level flowkey distributions)

Model Inference (their assumptions require to separate large and small flows)

Evaluation

Testbed

OVS, P4. Simulator is used for larger scale test (eliminates network transfer overhead).

Traces

CAIDA [1] and UN2 [2].

Experiments

Memory usage

Per-packet processing
Testbed throughput
Simulator throughput
Inference time
Generality

Fitting theorems
Robust to small thresholds

Arbitrary field combinations
Attaching error measures
Network-wide coordination

Source code

Link to github repo. Respect! :)

Limitation

Default parameters are limited
Does the model inference algorithm converge, especially with larger number of rows?

SketchLearn - Relieving User Burdens in Approximate Measurement with Automated Statistical Inference