I have logs which I am trying to push to Google BigQuery. I am trying to build the entire pipeline using google dataflow. The log structure is different and can be classified into four different type. In my pipeline I read logs from PubSub parse it and write to BigQuery table. The table to which the logs need to written is depending on one parameter in logs. The problem is I am stuck on a point where how to change TableName for BigQueryIO.Write at runtime.
Google dataflow write to mutiple tables based on input
1.5k views Asked by nikhil sharma At
1
There are 1 answers
Related Questions in GOOGLE-BIGQUERY
- SQL LAG() function returning 0 for every row despite available previous rows
- Convert C# DateTime.Ticks to Bigquery DateTime Format
- SELECT AS STRUCT/VALUES
- Google Datastream errors on larger MySQL tables
- Can i add new label called looker-context-look_id in BigQuery connection(Looker)
- BigQuery external table using JSON files
- Does Apache Beam's BigQuery IO Support JSON Datatype Fields for Streaming Inserts?
- sample query for review for improvement on big query
- How does Big Query differentiate between a day and month when we upload any CSV or text file?
- How to get max value of a column when ids are unique but they are related through different variables
- how to do a filter from a table where 2 different columns has 2 different records which has same set of key combinations in bigquery?
- How to return a string that has a special character - BigQuery
- How do I merge multiple tables into a new table in BigQuery?
- Customer Churn Calculation
- Is it correct to add "UNNEST" in the "ON" condition of a (left) join?
Related Questions in GOOGLE-CLOUD-DATAFLOW
- Can anyone explain the output of apache-beam streaming pipeline with Fixed Window of 60 seconds?
- How to stream data from Pub/Sub to Google BigTable using DataFlow?
- Google Cloud Dataflow data sampling issue
- how can i get a sense of the cost of my dataflow prime job?
- Google Cloud Dataflow Workbench instance is created via Terraform but notebook is not up
- BigQuery Storage WRITE API: Concurrent connections per project for small regions per region
- Programatically deploying and running beam pipelines on GCP Dataflow
- NameError: name 'beam' is not defined while running 'Create beam Row-ptransform
- How to pre-build worker container Dataflow? [Insights "SDK worker container image pre-building: can be enabled"]
- Writing to bigquery using apache beam throws error in between
- Generate data flow graph for ETL process
- Sample File is part of validating queries/ Power BI adds steps that ruin dataflow
- Airlfow DAG DataflowTemplatedJobStartOperator with Google Provided Template GCS_Text_to_Cloud_PubSub
- How to fetch distinct dates from a CSV file and iterate a query for deletion on Azure DataFactory Pipeline
- GCP PubSub to DLP Integration via Dataflow
Related Questions in GOOGLE-CLOUD-PUBSUB
- Does Apache Beam's BigQuery IO Support JSON Datatype Fields for Streaming Inserts?
- How to stream data from Pub/Sub to Google BigTable using DataFlow?
- App didn't recieved a gcp pubsub message for a minute
- GCP Pub Sub topics
- Unable initialise pub/sub with SparkSession
- Unexpected Redelivery of Messages in Google Cloud Pub/Sub with Cloud Run despite Successful Acknowledgment
- GCP PubSub to DLP Integration via Dataflow
- How can I export Pub/Sub messages using a Protobuf schema to a GCS bucket?
- Can I Trigger a Cloud Function Based on a Pub/Sub Subscription?
- Unable to migrate to spring 3.2.3. possible Issue with messagingGateway
- Flink Job consuming Google PubSub - DEADLINE_EXCEEDED exception
- KeyError in Apache Beam while reading from pubSub,'ref_PCollection_PCollection_6'
- How to create a Pub/Sub topic and send a message to its triggering Pub/Sub topic?
- Google Cloud Function Connection Error when Deployed but Works in Inline Editor
- Can I ack/nack message after the streaming pull timeout exceeds?
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
You can use side outputs.
https://cloud.google.com/dataflow/model/par-do#emitting-to-side-outputs-in-your-dofn
The following sample code, reads a BigQuery table and splits it in 3 different PCollections. Each PCollections ends up sent to a different Pub/Sub topic (which could be different BigQuery tables instead).