feat(Datastream): Add SQL Server (MSSQL) source support#3396
feat(Datastream): Add SQL Server (MSSQL) source support#3396pabloqc wants to merge 2 commits intoGoogleCloudPlatform:mainfrom
Conversation
|
Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA). View this failed invocation of the CLA check for more information. For the most up to date status, view the checks section at the bottom of the pull request. |
Summary of ChangesHello @pabloqc, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request significantly enhances the Datastream-to-BigQuery pipeline by integrating comprehensive support for SQL Server as a source database. It enables seamless data replication and change data capture from SQL Server instances to BigQuery, ensuring proper schema discovery, metadata handling, and data type conversions. This expansion broadens the utility of the pipeline for users relying on SQL Server for their data warehousing needs. Highlights
Changelog
Activity
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
…-to-BigQuery pipeline Add full SQL Server CDC support including Avro format processing, sort key definitions, schema discovery via Datastream API, BigQuery metadata schema, and type conversion mappings.
01e4e94 to
8155b42
Compare
0b624da to
41b1627
Compare
Summary
Source Connection Profile Type Not Supportederror when using SQL Server as a Datastream sourceFormatDatastreamRecordToJson) for correct metadata extraction (_metadata_schema,_metadata_lsn,_metadata_tx_id)_metadata_timestamp,_metadata_lsn) used by BigQuery MERGE operationsDataStreamClient), including table/column discovery, primary key extraction, and SQL Server-to-BigQuery type conversion_metadata_lsnto BigQuery default staging table schemaTest plan
testSqlServerSortFieldsinDatastreamRowTest— verifies sort keys are["_metadata_timestamp", "_metadata_lsn"]testProcessElement_sqlServerinFormatDatastreamJsonToJsonTest— verifies JSON format processing produces correct SQL Server metadata fieldsmvn test -pl v2/datastream-common)mvn compile -pl v2/datastream-common,v2/common,v2/datastream-to-bigquery -am)