Karp backend

This is the backend of Karp, SBX:s tool for managing lexical data and other structured data.

The basic structure in Karp is a resource, which is a collection of entries. Each resource may be configured according to the format of the data and other needs.

The backend consists of two parts. The command-line interface (CLI), used to manage resources and the web API for modifying and querying the data.

There is also a frontend, contact SBX for more information.

Installation

Follow the steps in getting started.

CLI

Use the CLI to create or modify resources, publish resources and do bulk editing. To view the CLI documentation, use:

karp-cli --help

The resource configuration is documented here.

There is also a tutorial describing creation of a resource.

Web API

The API documentation for the current version is available here.

Editing

Using the API (with credentials) one can:

add an entry to a resource
modify existing entries
delete an entry from a resource (discard, actual data is retained)

All edits are stored, along with time and the editor. The history of an entry is also available through the API.

Searching

Searching is done with our custom query language.

Searching supports sorting and pagination.

Versions

This is the version 7 of Karp backend, for the legacy version (v5).

Dependencies

We use MariaDB for storage and Elasticsearch for search.

Development

This project uses poetry.

A Makefile is provided to simplify tasks.

Getting started

First clone this repo: git clone or gh repo clone (if using github-cli).
Install dependencies:

make dev (or make install-dev)

Install MariaDB and create a database

Setup environment variables (can be placed in a .env file in the root and then ? poetry run sets those):

export DB_DATABASE=<name of database>
export DB_USER=<database user>
export DB_PASSWORD=<user's password>
export DB_HOST=localhost
export AUTH_JWT_PUBKEY_PATH=/path/to/pubkey

Activate the virtual environment by running: poetry shell
Run karp-cli db up to initialize database
Run make serve or make serve-w-reload to start development server

or poetry shell and then uvicorn --factory karp.karp_v6_api.main:create_app
To setup Elasticsearch, download Elasticsearch 8.x and run the following commands from the elasticsearch-8.XXX directory:
```
bin/elasticsearch-plugin install analysis-icu
```
Then run bin/elasticsearch -Expack.security.enabled=false to start it.
Add environment variables

export ELASTICSEARCH_HOST=http://localhost:9200

Create test resources

poetry shell and then:
karp-cli resource create assets/testing/config/places.yaml
karp-cli entries add places assets/testing/data/places.jsonl
Do the same for municipalities
karp-cli resource publish places 1
karp-cli resource publish municipalities 1

Technologies

Python

Python >= 3.10
Poetry >= 1.3
FastAPI
SQLAlchemy
Typer
Elasticsearch
Elasticsearch DSL

Databases

MariaDB
Elasticsearch

Type checking

Run type checking with make type-check or just basedpyright.

We use basedpyright which is like Pyright, but without a NodeJS dependency.

Currently actual type checking is only done on selected files, but basedpyright provides "syntax and sematic errors" for all files.

Testing

The tests are organized in unit, integration and end-to-end tests.

Unit tests

These test should have no infrastructure dependencies and should run fast.

Run them by: make test (or make unit-tests)

Integration tests

These test have some infrastructure dependencies and should run slower.

Run them by: make integration-tests

End-to-end tests

These test have all infrastructure dependencies and should run slowest.

Run them by: make e2e-tests

All tests

These test have all infrastructure dependencies and should run slowest. They also start with a type checking pass.

Run them by: make all-tests

Linting and formatting

Linting and formatting is done by ruff.

Run linter with make lint. Settings are in ruff.toml.

Run formatter with make fmt, check if formatting is needed make check-fmt.

Usual commands for ruff is:

ruff --fix <path> tries to fix linting problems.
ruff --add-noqa <path> add noqa:s (silence lint) to each line that needs

Version handling

Update version in the following files:

Name		Name	Last commit message	Last commit date
Latest commit History 2,619 Commits
.github/workflows		.github/workflows
assets/testing		assets/testing
docs		docs
grammars		grammars
karp		karp
repl_scripts		repl_scripts
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
codecov.yml		codecov.yml
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Karp backend

Installation

CLI

Web API

Editing

Searching

Versions

Dependencies

Development

Getting started

Create test resources

Technologies

Python

Databases

Type checking

Testing

Unit tests

Integration tests

End-to-end tests

All tests

Linting and formatting

Version handling

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

Uh oh!

License

Uh oh!

spraakbanken/karp-backend

Folders and files

Latest commit

History

Repository files navigation

Karp backend

Installation

CLI

Web API

Editing

Searching

Versions

Dependencies

Development

Getting started

Create test resources

Technologies

Python

Databases

Type checking

Testing

Unit tests

Integration tests

End-to-end tests

All tests

Linting and formatting

Version handling

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages