문제

Can I denormalize (by joining) multiple large tables in bigquery?

Table1 is 400M rows Table2 is 2M rows Table3 is 800K rows

If not, do I have to do it in my relational database before I upload? That would be a difficult solution.

Should I chunk the tables into smaller pieces and run iterations of joins over the tables so that it is always large to small? This would also be a difficult solution.

Thank you.

도움이 되었습니까?

해결책

BigQuery now supports "Big JOINs" which allows you skip the LIMIT's in your JOIN queries.

Docs here: https://developers.google.com/bigquery/docs/query-reference#joins

다른 팁

Yes you can make new tables based on query results.

Try to look here:

https://developers.google.com/bigquery/docs/queries

and here:

https://developers.google.com/bigquery/docs/tables#addmoredata

라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top