Sierra Chart Scid to PostgreSQL

Overview

This project reads Sierra Chart SCID files and writes the data to a PostgreSQL database. The data includes timestamp, price, quantity, and market side (bid/ask).

Prerequisites

Python 3.8+
PostgreSQL 12+
Required Python libraries:
- asyncpg
- polars
- numpy
- pandas
- pytz

You can install the required libraries using pip:

pip install asyncpg polars numpy pandas pytz

PostgreSQL Setup

Ensure you have a PostgreSQL database available. You will need the following details:

Host: e.g., localhost
Port: e.g., 5432
Username: e.g., your_username
Password: e.g., your_password
Database name: e.g., your_database

Project Files

db_create.py This script sets up the PostgreSQL table to store SCID data. Update the PostgreSQL connection details and the table name in this script.
checkpoint.json This file tracks the last processed position in the SCID file and whether the initial load is completed.
data_sync.py This script reads the SCID file and updates the PostgreSQL table at regular intervals. Update the table name, SCID file path, and other parameters as needed.

Usage

Step 1: Set Up the PostgreSQL Table

Update the PostgreSQL connection details and table name in db_create.py:

# Update these lines with your PostgreSQL credentials
host="localhost",
port="5432", 
user="your_username",
password="your_password",
database="your_database"

# Default table name is "esm24", but should be modified to match the specific contract symbol you are working with.
CREATE TABLE IF NOT EXISTS "esm24"

Run db_create.py to set up the database table:
```
python db_create.py
```

Step 2: Initialize the Checkpoint File

Ensure checkpoint.json is set up for the initial load. The table name in checkpoint.json should match the table name you set in db_create.py. For the first run, use the following configuration:
```
{"esm24": {"last_position": 0, "initial_load_done": false}}
```
Replace "esm24" with your specific table name if it's different.

Step 3: Start Data Synchronization

Update the table name and SCID file path in data_sync.py:

 table_name = "esm24"  # Specify the unique table name for your data.
 scid_file = "/Volumes/[C] Windows 11/Sierra/Data/ESM24-CME.scid"  # Set the file path to your SCID file.

Set the update interval in data_sync.py to continuously update data from the SCID file. Here, "1" means pause the execution for 1 second between updates:

while True:
asyncio.run(main(table_name, scid_file, initial_load=False))
time.sleep(1)  # Pause for 1 second before the next update. Adjust as needed.

Run data_sync.py to start synchronizing data (the initial run may take some time depending on the SCID file size and your system performance etc., please be patient):
```
python data_sync.py
```

Data Validation

To validate the data, use the provided Jupyter Notebook data_check.ipynb. This notebook connects to the PostgreSQL database, retrieves data, and performs basic validation checks.

Example Output:

min value:  4963.5
max value:  5333.5
min value of quantity:  1
max value of quantity:  2962
min value of side:  0
max value of side:  1
min value of actual_datetime:  2024-02-22 18:07:25.394000-06:00
max value of actual_datetime:  2024-05-10 15:59:59.478001-05:00
total rows count:  43818522
Number of null entries:
 price       0
quantity    0
side        0
dtype: int64
Unique values in 'side':  [1 0]
Distribution of 'price' values:  price
5211.75    95325
5212.50    95262
5211.50    93263
5212.00    90905
5212.75    90141
           ...  
4963.75      173
5330.75      126
5333.25      113
4963.50       34
5333.50        4
Name: count, Length: 1481, dtype: int64

Example Plot:

Here is an example plot of the price distribution.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
docs		docs
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
checkpoint.json		checkpoint.json
data_check.ipynb		data_check.ipynb
data_sync.py		data_sync.py
db_create.py		db_create.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sierra Chart Scid to PostgreSQL

Overview

Prerequisites

PostgreSQL Setup

Project Files

Usage

Step 1: Set Up the PostgreSQL Table

Step 2: Initialize the Checkpoint File

Step 3: Start Data Synchronization

Data Validation

About

Releases

Packages

Languages

License

n1c0la5-lab/SierraChartScidToPostgres

Folders and files

Latest commit

History

Repository files navigation

Sierra Chart Scid to PostgreSQL

Overview

Prerequisites

PostgreSQL Setup

Project Files

Usage

Step 1: Set Up the PostgreSQL Table

Step 2: Initialize the Checkpoint File

Step 3: Start Data Synchronization

Data Validation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages