Open
Description
Every while and then I see that TestQuerierWithBlocksStorageRunningInSingleBinaryMode
integration test is flaky (eg. this CI):
=== RUN TestQuerierWithBlocksStorageRunningInSingleBinaryMode/blocks_sharding_enabled,_ingester_gRPC_streaming_disabled,_inmemory_index_cache
08:05:54 Starting consul
08:05:55 consul: ==> Starting Consul agent...
08:05:55 consul: ==> Consul agent running!
08:05:55 consul: Version: 'v0.9.4'
08:05:55 consul: Node ID: 'a6770ff6-b258-8634-842f-dddfd5547b0a'
08:05:55 consul: Node name: 'consul'
08:05:55 consul: Datacenter: 'dc1' (Segment: '<all>')
08:05:55 consul: Server: true (Bootstrap: false)
08:05:55 consul: Client Addr: 0.0.0.0 (HTTP: 8500, HTTPS: -1, DNS: 8600)
08:05:55 consul: Cluster Addr: 127.0.0.1 (LAN: 8301, WAN: 8302)
08:05:55 consul: Encrypt: Gossip: false, TLS-Outgoing: false, TLS-Incoming: false
08:05:55 consul: ==> Log data will now stream in as it occurs:
08:05:55 Ports for container: e2e-cortex-test-consul Mapping: map[8500:33044]
08:05:55 Starting minio-9000
08:05:56 Ports for container: e2e-cortex-test-minio-9000 Mapping: map[9000:33045]
08:05:56 Starting memcached
08:05:56 minio-9000: Attempting encryption of all config, IAM users and policies on MinIO backend
08:05:58 Ports for container: e2e-cortex-test-memcached Mapping: map[11211:33046]
08:05:58 Starting cortex-1
08:05:59 cortex-1: level=warn ts=2020-07-08T08:05:59.301263543Z caller=experimental.go:19 msg="experimental feature in use" feature="Blocks storage engine"
08:05:59 cortex-1: level=warn ts=2020-07-08T08:05:59.301373647Z caller=experimental.go:19 msg="experimental feature in use" feature="Blocks storage engine"
08:05:59 cortex-1: level=warn ts=2020-07-08T08:05:59.303325568Z caller=modules.go:190 msg="Worker address is empty in single binary mode. Attempting automatic worker configuration. If queries are unresponsive consider configuring the worker explicitly." address=127.0.0.1:9095
08:05:59 Ports for container: e2e-cortex-test-cortex-1 Mapping: map[80:33048 9095:33047]
08:05:59 Starting cortex-2
08:06:00 cortex-2: level=warn ts=2020-07-08T08:06:00.385882716Z caller=experimental.go:19 msg="experimental feature in use" feature="Blocks storage engine"
08:06:00 cortex-2: level=warn ts=2020-07-08T08:06:00.386608276Z caller=experimental.go:19 msg="experimental feature in use" feature="Blocks storage engine"
08:06:00 cortex-2: level=warn ts=2020-07-08T08:06:00.404054134Z caller=modules.go:190 msg="Worker address is empty in single binary mode. Attempting automatic worker configuration. If queries are unresponsive consider configuring the worker explicitly." address=127.0.0.1:9095
08:06:00 Ports for container: e2e-cortex-test-cortex-2 Mapping: map[80:33050 9095:33049]
08:06:05 cortex-2: level=warn ts=2020-07-08T08:06:05.442784029Z caller=blocks_scanner.go:406 msg="found partial blocks" user=user-1 blocks=01ECPQBNVW47E4D6ZYR2PENZR3 err="meta.json not found"
08:06:07 cortex-2: level=warn ts=2020-07-08T08:06:07.448660828Z caller=blocks_scanner.go:406 msg="found partial blocks" user=user-1 blocks=01ECPQBQTCMQTW19WXS4TRPBX5 err="meta.json not found"
08:06:08 consul: ==> Newer Consul version available: 1.8.0 (currently running: 0.9.4)
TestQuerierWithBlocksStorageRunningInSingleBinaryMode/blocks_sharding_enabled,_ingester_gRPC_streaming_disabled,_inmemory_index_cache: querier_test.go:369:
Error Trace: querier_test.go:369
Error: Received unexpected error:
unable to find metrics [thanos_store_index_cache_requests_total] with expected values. Last values: [11]
Test: TestQuerierWithBlocksStorageRunningInSingleBinaryMode/blocks_sharding_enabled,_ingester_gRPC_streaming_disabled,_inmemory_index_cache
08:06:33 Killing cortex-2
08:06:33 cortex-2: level=error ts=2020-07-08T08:06:33.583550354Z caller=client.go:215 msg="error getting path" key=collectors/store-gateway err="Get \"http://e2e-cortex-test-consul:8500/v1/kv/collectors/store-gateway?index=30&stale=&wait=10000ms\": context canceled"
08:06:33 cortex-2: level=error ts=2020-07-08T08:06:33.583798103Z caller=worker_frontend_manager.go:96 msg="error processing requests" err="rpc error: code = Canceled desc = context canceled"
08:06:33 cortex-2: level=error ts=2020-07-08T08:06:33.584304101Z caller=client.go:215 msg="error getting path" key=collectors/store-gateway err="Get \"http://e2e-cortex-test-consul:8500/v1/kv/collectors/store-gateway?index=30&stale=&wait=10000ms\": context canceled"
08:06:33 cortex-2: level=error ts=2020-07-08T08:06:33.584473392Z caller=client.go:215 msg="error getting path" key=collectors/ring err="Get \"http://e2e-cortex-test-consul:8500/v1/kv/collectors/ring?index=31&stale=&wait=10000ms\": context canceled"
08:06:33 cortex-2: level=warn ts=2020-07-08T08:06:33.596942606Z caller=transfer.go:431 msg="transfer attempt failed" err="cannot find ingester to transfer blocks to: no pending ingesters" attempt=1 max_retries=10
08:06:34 Killing cortex-1
08:06:34 cortex-1: level=error ts=2020-07-08T08:06:34.101657012Z caller=worker_frontend_manager.go:96 msg="error processing requests" err="rpc error: code = Canceled desc = context canceled"
08:06:34 cortex-1: level=error ts=2020-07-08T08:06:34.101929695Z caller=client.go:215 msg="error getting path" key=collectors/store-gateway err="Get \"http://e2e-cortex-test-consul:8500/v1/kv/collectors/store-gateway?index=34&stale=&wait=10000ms\": context canceled"
08:06:34 cortex-1: level=error ts=2020-07-08T08:06:34.102637776Z caller=client.go:215 msg="error getting path" key=collectors/store-gateway err="Get \"http://e2e-cortex-test-consul:8500/v1/kv/collectors/store-gateway?index=34&stale=&wait=10000ms\": context canceled"
08:06:34 cortex-1: level=error ts=2020-07-08T08:06:34.102801363Z caller=client.go:215 msg="error getting path" key=collectors/ring err="Get \"http://e2e-cortex-test-consul:8500/v1/kv/collectors/ring?index=32&stale=&wait=10000ms\": context canceled"
08:06:34 cortex-1: level=warn ts=2020-07-08T08:06:34.107613863Z caller=transfer.go:431 msg="transfer attempt failed" err="cannot find ingester to transfer blocks to: no pending ingesters" attempt=1 max_retries=10
08:06:34 Killing memcached
08:06:34 memcached: Signal handled: Terminated.
08:06:34 Killing minio-9000
08:06:34 minio-9000: Exiting on signal: TERMINATED
08:06:35 Killing consul