Verification and Correction
BladePipe provides a robust data verification and correction feature. It finds differences between your source and target databases. It can also fix missing or inconsistent data automatically.
BladePipe runs a second verification check. This helps reduce errors caused by latency, and ensures high data accuracy.
This page describes how to create and manage data verification and correction DataJobs. You can configure one-time DataJobs, reliable periodic verification DataJobs, custom code evaluations, or verification subtasks.
Create a One-time DataJob
To create a one-time data correction pipeline, follow these steps:
- Log in to BladePipe.
- In the navigation bar, click DataJob > Create DataJob.
- Select the source and target DataSource, and then click Next.
- Configure the DataJob:
- For DataJob Type, select Verification and Correction.
- For Verification, select One-time.
- Choose your Correction Mode:
- Revise after Check: BladePipe automatically corrects the data after the verification task completes.
- NONE: BladePipe does not automatically correct the data. To manually trigger a correction, go to the DataJob Details page and click Functions > Create Correction DataJob.
- Opt to start the DataJob automatically or save it for later. By default, the DataJob starts automatically upon creation.
- After completing the configuration, click Next.
- Select the tables to be verified. You can only select existing tables.
- Select the columns to be verified. You can deselect the columns you do not want to check.
- Click Create DataJob.
- View the job progress in the DataJob list.
Create a Scheduled DataJob
- Log in to BladePipe.
- In the navigation bar, click DataJob > Create DataJob.
- Select the source and target DataSource, and then click Next.
- Configure the DataJob:
- For DataJob Type, select Verification and Correction.
- For Verification, select Scheduled, and configure the job execution cycle.
- Choose your Correction Mode:
- Revise after Check: BladePipe automatically corrects the data after the verification task completes.
- NONE: BladePipe does not automatically correct the data. To manually trigger a correction, go to the DataJob Details page and click Functions > Create Correction DataJob.
- Opt to start the DataJob automatically or save it for later. By default, the DataJob starts automatically upon creation.
- After completing the configuration, click Next.
- Select the tables to be verified. You can only select existing tables.
- Select the columns to be verified. You can deselect any column you do not want to check for data accuracy.
- Click Create DataJob.
- View the job progress in the DataJob list.
Manage a Scheduled DataJob
Manage DataJob Execution Time
In the DataJob list, you can view the job progress and its next execution time. BladePipe automatically starts the data sync error checking when the scheduled time arrives.
To run the verification process immediately, click Execute Now in the operation column. The DataJob will start within two minutes.