🐟 Troutlytics Backend

Description

Troutlytics is a data-driven Python application that scrapes and stores trout stocking data for Washington State lakes. It runs on a scheduled AWS Fargate task and stores results in an Aurora PostgreSQL database for use in dashboards, maps, and analysis tools.

📦 Project Structure

.
├── api/                          # 🎯 Main application API
│   ├── __init__.py              # API package initializer
│   ├── index.py                 # API entrypoint (routes/controllers)
│   ├── requirements.txt         # API dependencies
│   ├── dockerfiles/
│   │   ├── dev/Dockerfile       # Dev Dockerfile
│   │   └── prod/                # Production Dockerfile (Lambda-ready)
│   │       ├── Dockerfile
│   │       └── lambda_entry_script.sh
│   └── README.md                # API-specific usage docs

├── web_scraper/                 # 🕸️ Web scraping service
│   ├── __init__.py
│   ├── scraper.py              # Main script for collecting trout/creel data
│   ├── Dockerfile              # Docker setup for scraper
│   ├── Makefile                # Shortcuts for common dev tasks
│   ├── requirements.txt
│   ├── README.md
│   └── tests/                  # 🔬 Pytest-based tests
│       ├── __init__.py
│       └── test_scraper.py

├── data/                        # 🗃️ Database models and storage
│   ├── __init__.py
│   ├── database.py             # SQLAlchemy engine and session config
│   ├── models.py               # ORM models for tables
│   ├── backup_data.sql         # SQL dump for backup or restore
│   ├── backup_data.txt         # Raw text backup
│   └── sqlite.db               # Local development database

├── aws_config/                 # ☁️ AWS deployment and secrets setup
│   ├── configure-aws-credentials-latest.yml  # GitHub Actions for AWS login
│   └── fargate-rds-secrets.yaml              # Fargate setup with RDS and Secrets Manager served there)
├── README.md              # You are here 📘

⸻

🚀 Deployment Overview

AWS Infrastructure:

Fargate runs the scraper every 24 hours via EventBridge.
Secrets Manager securely stores DB credentials.
Aurora PostgreSQL stores structured stocking data.
CloudWatch Logs tracks runtime output for visibility.
API hosted with API Gateway and Lambda

GitHub → ECR Workflow:

Automatically builds and pushes Docker image on main branch updates.
Uses secure OIDC GitHub Actions role to push to ECR.

⸻

📋 Prerequisites

An AWS Account configured with appropriate permissions
AWS CLI configured with appropriate permissions
Docker installed (for local and prod builds)
Python 3.11+

⸻

🧪 Run Locally

🚀 Docker Compose Commands Cheat Sheet

Everything is ran from the root repo folder

Action	Command	Notes
Build all services	`docker compose build`	Build all images
Start all services	`docker compose up`	Start API(s), Scraper
Start all + rebuild	`docker compose up --build`	Force rebuild before starting
Start dev API only	`docker compose up api-dev`	Starts API Dev service
Start prod API only	`docker compose up api-prod`	Starts API Prod service
Start scraper only	`docker compose up web-scraper`	Starts Scraper
Stop all services	`docker compose down`	Stops and removes all containers and networks
Rebuild dev API only	`docker compose build api-dev`	Rebuild only the dev API image
Rebuild prod API only	`docker compose build api-prod`	Rebuild only the prod API image
Rebuild scraper only	`docker compose build web-scraper`	Rebuild only the scraper image
View running containers	`docker compose ps`	Show status of all services
View logs (all services)	`docker compose logs`	View logs for all services
Follow logs live	`docker compose logs -f`	Stream logs in real time
Stop dev API	`docker compose stop api-dev`	Stop only the dev API container
Stop prod API	`docker compose stop api-prod`	Stop only the prod API container
Stop scraper	`docker compose stop web-scraper`	Stop only the scraper container
Restart all containers	`docker compose restart`	Restart all running services

🛠️ Cloud Setup

Deploy the CloudFormation Stack:

aws cloudformation deploy \
  --template-file aws_config/configure-aws-credentials-latest.yml \
  --stack-name troutlytics-stack \
  --capabilities CAPABILITY_NAMED_IAM \
  --parameter-overrides \
    ECRImageUriScraper=123456789012.dkr.ecr.us-west-2.amazonaws.com/scraper:latest \
    ECRImageUriAPI=123456789012.dkr.ecr.us-west-2.amazonaws.com/api:latest \
    VpcId=vpc-xxxxxxxx \
    SubnetIds=subnet-aaaa,subnet-bbbb \
    SecurityGroupId=sg-xxxxxxxx

⸻

🔐 GitHub → ECR Deploy (CI/CD)

To enable GitHub Actions auto-deploy:

Deploy the github_oidc_ecr_access.yaml CloudFormation template.
Add the output IAM Role ARN to your GitHub Actions secrets or workflows.
Push to main — your image builds and publishes to ECR automatically.

⸻

📈 Roadmap Ideas

Add support for weather/streamflow overlays
Enable historical trend analysis by lake
Integrate public stocking alerts
Expand scraper coverage to other regions or species

⸻

🧠 Credits

Created by @thomas-basham — U.S. Army veteran, full-stack developer, and passionate angler 🎣

⸻

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 510 Commits
.github		.github
api		api
aws_config		aws_config
data		data
web_scraper		web_scraper
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🐟 Troutlytics Backend

Description

📦 Project Structure

🚀 Docker Compose Commands Cheat Sheet

🛠️ Cloud Setup

About

Uh oh!

Uh oh!

Languages

License

troutlytics/troutlytics-backend

Folders and files

Latest commit

History

Repository files navigation

🐟 Troutlytics Backend

Description

📦 Project Structure

🚀 Docker Compose Commands Cheat Sheet

🛠️ Cloud Setup

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages