Learn about latency and server location

  • Place your servers near relevant Google Cloud regions and configure DNS to route traffic efficiently to meet latency requirements.

  • Using an HTTP persistent connection is strongly recommended to reduce latency by reusing connections for segment requests.

  • While requests are often sent from the closest region, placing servers near multiple key regions helps receive a more complete set of requests for your target locations.

  • The deadline for sending a segment response is 50 ms, including network time and server processing.

To help meet the latency restrictions of the Real-time Curation service, place your servers near the Google Cloud regions most likely to send segment requests to locations you're targeting. Provision and configure DNS such that traffic sent to your endpoint from a given region is routed to your closest server.

The following table lists Google Cloud regions used in Real-time Curation, their approximate location, and examples of targeted locations that are most likely to be sent to your servers in a SegmentRequest from those regions:

Google Cloud Region Region Location Example target location
us-east1 South Carolina, United States North America (East Coast)
us-west1 Oregon, United States North America (West Coast)
us-central1 Iowa, United States North America (Central)
europe-west1 Belgium Europe
europe-west4 Amsterdam, Netherlands Europe
asia-southeast1 Singapore Asia
asia-east1 Taiwan Asia

Use an HTTP persistent connection to reduce latency

To reduce latency, Google strongly recommends that you configure your Real-time Curation integration to use a persistent connection. After a persistent connection is established, your endpoint will reuse connections rather than create a new one for each incoming segment request.

Server location

While you can expect that most segment requests are sent from the region that is closest to the user's location, Google doesn't guarantee that to always be the case. As you place servers close to a greater number of regions, you can expect to receive a more complete set of segment requests for your targeted locations. Google recommends placing servers near a subset of regions appropriate for your targeted locations. For example, most North American traffic can be received by placing your servers close to the us-east1, us-west1, and us-central1 regions.

The deadline that you must send a segment response by is 50 ms.

This deadline includes both the network time between the region and your server, and the time for your server to build a response. We recommend you have at least a 10 ms buffer for unexpected changes in network latency.

Next Steps