You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/en/guides/40-load-data/02-load-db/tapdata.md
+4-4Lines changed: 4 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ title: Tapdata
4
4
5
5
[Tapdata](https://tapdata.net) is a platform-oriented product designed for data services, aimed at helping enterprises break down multiple data silos, achieve rapid data delivery, and enhance data transfer efficiency through real-time data synchronization. We also support the creation of tasks through a low-code approach, making it easy to create tasks with a simple drag-and-drop of nodes, effectively reducing development complexity and shortening project deployment cycles.
6
6
7
-
[Databend](https://docs.tapdata.io/cloud/introduction/supported-databases#beta-%E6%95%B0%E6%8D%AE%E6%BA%90) is one of the data sources supported by Tapdata. You can use Tapdata to synchronize data from other platforms to Databend, using Databend as the **destination** for data migration/synchronization.
7
+
Databend is one of the data sources supported by Tapdata. You can use Tapdata to synchronize data from other platforms to Databend, using Databend as the **destination** for data migration/synchronization.
@@ -16,16 +16,16 @@ To establish a connection with Databend Cloud and set it as the synchronization
16
16
17
17
Tapdata Agent is a key program in data synchronization, data heterogeneity, and data development scenarios. Given the high real-time requirements for data flow in these scenarios, deploying Tapdata Agent in your local environment ensures optimal performance based on low-latency local networks to guarantee real-time data flow.
18
18
19
-
For Tapdata Agent download and installation instructions, please refer to [Step 1: Install Agent](https://docs.tapdata.io/cloud/quick-start/install-agent/).
19
+
For Tapdata Agent download and installation instructions, please refer to [Step 1: Provision TapData - Tapdata Cloud](https://docs.tapdata.io/quick-start/install/install-tapdata-agent).
20
20
21
21
### Step 2: Create Connections
22
22
23
-
You need to establish a connection for each of the data source and data destination for data synchronization. For example, if you want to synchronize data from MySQL to Databend Cloud, you need to create two connections on Tapdata Cloud—one connecting to MySQL and the other to Databend Cloud. Follow the steps outlined in [Step 2: Connect Data Sources](https://docs.tapdata.io/cloud/quick-start/connect-database) for creating connections.
23
+
You need to establish a connection for each of the data source and data destination for data synchronization. For example, if you want to synchronize data from MySQL to Databend Cloud, you need to create two connections on Tapdata Cloud—one connecting to MySQL and the other to Databend Cloud. Follow the steps outlined in [Step 2: Connect Data Sources](https://docs.tapdata.io/quick-start/connect-database) for creating connections.
24
24
25
25
Here is an example of connecting to Databend Cloud:
Once connections to the data source and Databend Cloud are established, you can begin data synchronization by creating data replication tasks. Refer to [Create a Data Replication Task](https://docs.tapdata.io/cloud/user-guide/copy-data/create-task) for the operational steps.
31
+
Once connections to the data source and Databend Cloud are established, you can begin data synchronization by creating data replication tasks. Refer to [Create a Data Replication Task](https://docs.tapdata.io/user-guide/data-pipeline/copy-data/create-task) for the operational steps.
Copy file name to clipboardExpand all lines: docs/en/guides/90-community/02-rfcs/20220805-json-optimization.md
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -40,7 +40,7 @@ Extracting and storing virtual columns requires extra parsing processes and stor
40
40
We should only generate virtual columns for frequently queried JSON key paths.
41
41
42
42
In order to know which key paths are frequently queried,
43
-
we use the [FPGrowth algorithm](https://www.cs.sfu.ca/~jpei/publications/sigmod00.pdf) to count the access frequency of the JSON data key paths.
43
+
we use the [FPGrowth algorithm](https://link.springer.com/content/pdf/10.1023/B:DAMI.0000005258.31418.83.pdf) to count the access frequency of the JSON data key paths.
44
44
"FP" stands for frequent pattern, usually used to calculate item frequencies and identify frequent items.
45
45
Every time the user queries one key path of the JSON data, it will be recorded by the FPGrowth algorithm, and finally a tree-like statistical information is generated.
46
46
We can use this statistical information to determine which key paths need to generate virtual columns.
0 commit comments