congressional-record

This tool converts HTML files containing the text of the Congressional Record into structured text data. It is particularly useful for identifying speeches by members of Congress.

From the repository root, type python -m congressionalrecord.cli -h for instructions.

It outputs JSON
Instances of speech are tagged with the speaker's bioguideid wherever possible
Instances of speech are recorded as "turns," such that each subsequent instance of speech by a Member counts as a new "turn."

This software is released as-is under the BSD3 License, with no warranty of any kind.

installation

Clone and download the repository:

git clone https://github.com/unitedstates/congressional-record.git
cd congressional-record

In Python 3 using venv for e.g.:

python3 -m venv .venv
.venv/bin/python -m pip install -e .

then .venv/bin/python -m congressionalrecord.cli -h to see usage instructions.

If using Python 3 with uv, use:

uv sync

then uv run python -m congressionalrecord.cli -h to see usage instructions.

Recommended citation:

Judd, Nicholas, Dan Drinkard, Jeremy Carbaugh, and Lindsay Young. congressional-record: A parser for the Congressional Record. Chicago, IL: 2017.

Name		Name	Last commit message	Last commit date
Latest commit History 307 Commits
.github/workflows		.github/workflows
.woodpecker		.woodpecker
congressionalrecord		congressionalrecord
dev_scripts		dev_scripts
docs		docs
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.travis.yml		.travis.yml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_tests.py		run_tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

congressional-record

installation

Recommended citation:

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 13

Uh oh!

Languages

License

unitedstates/congressional-record

Folders and files

Latest commit

History

Repository files navigation

congressional-record

installation

Recommended citation:

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 13

Uh oh!

Languages

Packages