DAP Interoperation Test Design

Internet-Draft	DAP Interoperation Test Design	August 2022
Cook	Expires 4 March 2023	[Page]

Abstract

This document defines a common test interface for implementations of the Distributed Aggregation Protocol for Privacy Preserving Measurement (DAP-PPM) and describes how this test interface can be used to perform interoperation testing between the implementations. Tests are orchestrated with containers, and new test-only APIs are introduced to provision DAP-PPM tasks and initiate processing.¶

4. Interoperation Test API

Each container will have an HTTP server listening on port 8080 for commands from the test runner. All requests MUST use the HTTP method POST. Requests and responses for each endpoint listed below SHALL be encoded JSON objects [RFC8729], with media type application/json. All binary blobs (i.e. task IDs, HPKE configurations, and verification keys) SHALL be encoded as strings with base64url [RFC4648], inside the JSON objects. Certain integer values will be encoded as strings in base 10 instead of as numbers, where noted, if JSON numbers cannot fully represent the range of valid values.¶

Each of these test APIs should return a status code of 200 OK if the command was received, recognized, and parsed successfully, regardless of whether any underlying DAP-PPM request succeeded or failed. The DAP-level success or failure will be included in the test API response body. If a request is made to an endpoint starting with "/internal/test/", but not listed here, a status code of 404 Not Found SHOULD be returned, to simplify the introduction of new test APIs.¶

4.1. Common Structures

In multiple APIs defined below, the test runner will send the name of a VDAF, along with the parameters necessary to fully specify the VDAF. These will be stored in a nested object, with the following attributes (new type values and new keys will be added as new VDAFs are defined).¶

Table 1: VDAF JSON object structure
Key	Value
`type`	One of `"Prio3Aes128Count"`, `"Prio3Aes128Sum"`, or `"Prio3Aes128Histogram"`
`bits` (only present if `type` is `"Prio3Aes128Sum"`)	The bit width of the integers being summed, (as a number) used to parameterize the Prio3Aes128Sum VDAF.
`buckets` (only present if `type` is `"Prio3Aes128Histogram"`)	An array of histogram bucket boundaries, (encoded in base 10 as strings) used to parameterize the Prio3Aes128Histogram VDAF.

4.2. Client

4.2.1. `/internal/test/ready`

The test runner will POST an empty object (i.e. {}) to this endpoint to check if the client container is ready to serve requests. If it is ready, it MUST return a status code of 200 OK.¶

4.2.2. `/internal/test/upload`

Upon receipt of this command, the client container will construct a DAP-PPM report with the given configuration and measurement, and submit it. The client container will send its response to the test runner once report submission has either succeeded or permanently failed.¶

Table 2: Request JSON object structure
Key	Value
`taskId`	A base64url-encoded DAP-PPM `TaskId`.
`leader`	The leader's endpoint URL.
`helper`	The helper's endpoint URL.
`vdaf`	An object, with the layout given in {vdaf-object}. This determines the VDAF to be used when constructing a report.
`measurement`	If the VDAF's `type` is `"Prio3Aes128Count"`: 0 or 1. If the VDAF's `type` is `"Prio3Aes128Sum"`: a string (representing an integer in base 10). If the VDAF's `type` is `"Prio3Aes128Histogram"`: a string (representing an integer in base 10).
`nonceTime` (optional)	If present, this provides a substitute time value that should be used when constructing the report. If not present, the current system time should be used, as per normal. The time is represented as a number, with a value of the number of seconds since the UNIX epoch.
`minBatchDuration`	A number, providing the minimum number of seconds that can be in a batch's interval. The batch interval will always be a multiple of this value.

Table 3: Response JSON object structure
Key	Value
`status`	`"success"` if the report was submitted to the leader successfully, or `"error"` otherwise.
`error` (optional)	An optional error message, to assist in troubleshooting. This will be included in the test runner logs.

4.3. Aggregator (Leader or Helper)

4.3.1. `/internal/test/ready`

The test runner will POST an empty object (i.e. {}) to this endpoint to check if the aggregator container is ready to serve requests. If it is ready, it MUST return a status code of 200 OK.¶

4.3.2. `/internal/test/endpoint_for_task`

Request the base URL for DAP-PPM endpoints for a new task. This API will be invoked immediately before /internal/test/add_task (see Section 4.3.3), to determine the endpoint URLs of the aggregators. If the aggregator uses a common set of DAP-PPM endpoints for all tasks, it could always return the same value, such as the relative URL /. Alternately, implementations may wish to generate new endpoints for each task, derive the endpoint based on the TaskId, etc.¶

The test runner will provide the hostname at which the aggregator is externally reachable. If the aggregator returns a relative URL, the test runner will combine it with the hostname into an absolute URL, assuming that the port is 8080. Otherwise, the aggregator can incorporate the hostname into an absolute URL and return that.¶

Table 4: Request JSON object structure
Key	Value
`taskId`	A base64url-encoded DAP-PPM `TaskId`
`aggregatorId`	0 if this aggregator is the leader, or 1 if this aggregator is the helper.
`hostname`	This aggregator's hostname in the interoperation test environment. This may optionally be used in constructing the endpoint URL as an absolute URL.

Table 5: Response JSON object structure
Key	Value
`status`	`"success"` if the endpoint was successfully selected or set up, or `"error"` otherwise.
`error` (optional)	An optional error message, to assist in troubleshooting. This will be included in the test runner logs.
`endpoint`	A relative or absolute URL, specifying the DAP-PPM aggregator endpoint that should be used for this task. If the test runner receives a relative URL, it will transform it into an absolute URL before performing the next phase of task setup.

4.3.3. `/internal/test/add_task`

The HPKE keypair generated for this task should use the mandatory-to-implement algorithms in section 6 of [DAP-PPM], for broad compatibility.¶

Table 6: Request JSON object structure
Key	Value
`taskId`	A base64url-encoded DAP-PPM `TaskId`.
`leader`	The leader's endpoint URL. The test runner will ensure this is an absolute URL.
`helper`	The helper's endpoint URL. The test runner will ensure this is an absolute URL.
`vdaf`	An object, with the layout given in {vdaf-object}. This determines the task's VDAF.
`leaderAuthenticationToken`	The authentication bearer token that is shared with the other aggregator, as a string. This string must be safe for use as an HTTP header value.
`collectorAuthenticationToken` (only present if `aggregatorId` is 0)	The authentication bearer token that is shared between the leader and collector, as a string. This string must be safe for use as an HTTP header value.
`aggregatorId`	0 if this aggregator is the leader, or 1 if this aggregator is the helper.
`verifyKey`	The verification key shared by the two aggregators, encoded with base64url.
`maxBatchLifetime`	A number, providing the maximum number of times any report can be included in a collect request.
`minBatchSize`	A number, providing the minimum number of reports that must be in a batch for it to be collected.
`minBatchDuration`	A number, providing the minimum number of seconds that can be in a batch's interval. The batch interval will always be a multiple of this value.
`collectorHpkeConfig`	The collector's HPKE configuration, encoded in base64url, for encryption of aggregate shares.

Table 7: Response JSON object structure
Key	Value
`status`	`"success"` if the task was successfully set up, or `"error"` otherwise. (for example, if the VDAF was not supported)
`error` (optional)	An optional error message, to assist in troubleshooting. This will be included in the test runner logs.

4.4. Collector

4.4.1. `/internal/test/ready`

The test runner will POST an empty object (i.e. {}) to this endpoint to check if the collector container is ready to serve requests. If it is ready, it MUST return a status code of 200 OK.¶

4.4.2. `/internal/test/add_task`

Register a task with the collector, with the given configuration. Returns the collector's HPKE configuration for this task.¶

Table 8: Request JSON object structure
Key	Value
`taskId`	A base64url-encoded DAP-PPM `TaskId`.
`leader`	The leader's endpoint URL.
`vdaf`	An object, with the layout given in {vdaf-object}. This determines the task's VDAF.
`collectorAuthenticationToken`	The authentication bearer token that is shared between the leader and collector, as a string. This string must be safe for use as an HTTP header value.

Table 9: Response JSON object structure
Key	Value
`status`	`"success"` if the task was successfully set up, or `"error"` otherwise. (for example, if the VDAF was not supported)
`error` (optional)	An optional error message, to assist in troubleshooting. This will be included in the test runner logs.
`collectorHpkeConfig` (if successful)	The collector's HPKE configuration, encoded in base64url, for encryption of aggregate shares.

4.4.3. `/internal/test/collect_start`

Send a collect request to the leader with the provided parameters, and return a handle to the test runner identifying this collect request. The test runner will provide this handle to the collector in subsequent /internal/test/collect_poll requests (see Section 4.4.4).¶

Table 10: Request JSON object structure
Key	Value
`taskId`	A base64url-encoded DAP-PPM `TaskId`.
`aggParam`	A base64url-encoded aggregation parameter.
`batchIntervalStart`	The start of the batch interval, represented as a number equal to the number of seconds since the UNIX epoch.
`batchIntervalDuration`	The duration of the batch interval in seconds, as a number.

Table 11: Response JSON object structure
Key	Value
`status`	`"success"` if the collect request succeeded, or `"error"` otherwise.
`error` (optional)	An optional error message, to assist in troubleshooting. This will be included in the test runner logs.
`handle` (if successful)	A handle produced by the collector to refer to this collect request. This must be a string.

4.4.4. `/internal/test/collect_poll`

Upon receiving this command, the collector will poll the leader's collect URL for the collect job associated with the provided handle, and provide the status and result to the test runner.¶

Table 12: Request JSON object structure
Key	Value
`handle`	The handle for a collect request from a previous invocation of `/internal/test/collect_start`. (see Section 4.4.3)

Table 13: Response JSON object structure
Key	Value
`status`	Either `"complete"` if the result was returned, `"in progress"` if the result was not yet ready, or `"error"` if an error occurred.
`error` (optional)	An optional error message, to assist in troubleshooting. This will be included in the test runner logs.
`result` (if complete)	The result of the aggregation. If the VDAF is of type Prio3Aes128Count, this will be a number. If the VDAF is of type Prio3Aes128Sum, this will be a string, representing an integer in base 10. If the VDAF is of type Prio3Aes128Histogram, this will be an array of numbers.

4.4.5. Heavy Hitters

Once Poplar1 reaches a future draft of [DAP-PPM], additional test APIs for collector containers should be introduced to perform an entire Heavy Hitters computation on a given Poplar1 task and collection interval, encompassing multiple collect flows automatically initiated by the collector.¶

4.5. Test Cases

Test cases could be written to cover the following scenarios.¶

Test successful aggregations with each VDAF.¶
Test an aggregation over a few hundred or thousand reports, to exercise the aggregators' division of reports into aggregation jobs.¶
Test that uploading a report with a time far in the future is rejected.¶
Confirm that leaders and helpers reject requests with respective authentication tokens that are incorrect.¶
Test enforcement of max_batch_lifetime by making overlapping collect requests.¶
Perform an entire aggregation and collect flow, attempt to upload a late report that falls into the same collect interval, and test that performing the collect request a second time yields the same result.¶
Attempt to upload a canned report from the test runner more than once, and confirm that anti-replay measures were effective by inspecting the aggregation result.¶

4.6. Other Test Considerations

All test cases should automatically fail after a generous timeout.¶

It is the responsibility of the test runner to wait for all containers to start up and respond successfully to a request to /internal/test/ready before sending any further commands.¶

Aggregator URLs will be constructed by the test runner with hostnames that resolve to the respective containers within the container network.¶

Once a future [DAP-PPM] draft solves the issue of retries in the aggregate flow, a reverse proxy could be introduced in front of each aggregator to inject failures when sending requests or responses, to test the protocol's resilience. (It is known such a test would fail based on the current protocol.)¶

4.7. Test Runner Operation

The following sequence outlines how the test runner will use the above APIs on port 8080 of each container to perform a typical integration test, executing a successful aggregation.¶

Create and start containers.¶
Set up networking between containers.¶
Try sending /internal/test/ready requests to each container, and retry until they succeed.¶
Generate a random TaskId, random authentication tokens, and a VDAF verification key.¶
Send a /internal/test/endpoint_for_task request (Section 4.3.2) to the leader.¶
Send a /internal/test/endpoint_for_task request to the helper.¶
Construct aggregator URLs using the above responses.¶
Send a /internal/test/add_task request (Section 4.4.2) to the collector. (the collector generates an HPKE key pair as a side-effect)¶
Send a /internal/test/add_task request (Section 4.3.3) to the leader.¶
Send a /internal/test/add_task request (Section 4.3.3) to the helper.¶
Send one or more /internal/test/upload requests (Section 4.2.2) to the client.¶
Send a /internal/test/collect_start request (Section 4.4.3) to the collector. (this provides a handle for use in the next step)¶
Send /internal/test/collect_poll requests (Section 4.4.4) to the collector, polling until it is completed. (the collector will provide the calculated aggregate result)¶
Stop containers.¶
Copy logs out of each container.¶
Delete containers, and clean up container networking resources.¶

DAP Interoperation Test Design

Abstract

About This Document

Status of This Memo

Copyright Notice

Table of Contents

1. Introduction

2. Conventions and Definitions

3. Container Interface

4. Interoperation Test API

4.1. Common Structures

4.2. Client

4.2.1. `/internal/test/ready`

4.2.2. `/internal/test/upload`

4.3. Aggregator (Leader or Helper)

4.3.1. `/internal/test/ready`

4.3.2. `/internal/test/endpoint_for_task`

4.3.3. `/internal/test/add_task`

4.4. Collector

4.4.1. `/internal/test/ready`

4.4.2. `/internal/test/add_task`

4.4.3. `/internal/test/collect_start`

4.4.4. `/internal/test/collect_poll`

4.4.5. Heavy Hitters

4.5. Test Cases

4.6. Other Test Considerations

4.7. Test Runner Operation

5. Implementation Status

6. Security Considerations

7. IANA Considerations

8. References

8.1. Normative References

8.2. Informative References

Acknowledgments

Author's Address