BigQuery's query is extremely slow -


i have table 1.6 billion rows. have been running query uses group-by field has on 5 million unique values , sort sum of integer value in descending order , return top 10. notice after more hour, query still stuck in running state.

i have created big table using "bq cp -a ". source tables "bq cp" 1000 smaller tables , each table loaded on 12 compressed csv load files.

i have searched related question , found "google bigquery running queries slowly" mention slowness caused fragmentation lot of small ingestion. approach of data infestion consider "too small data bit" during ingestion caused fragmentation?

is possible 5 million unique values , root cause of slow response?

we've had latency spike yesterday, , smaller 1 today. can give project id + job ids of query jobs took longer expected?


Comments

Popular posts from this blog

python - How to create a legend for 3D bar in matplotlib? -

java - Multi-Label Document Classification -

php - Dynamic url re-writing using htaccess -