|
| 1 | +# Querying |
| 2 | + |
| 3 | +The Python Elasticsearch client provides several ways to send queries to Elasticsearch. This document explains the details of how to construct and execute queries using the client. This document does not cover the DSL module. |
| 4 | + |
| 5 | +## From API URLs to function calls |
| 6 | + |
| 7 | +Elasticsearch APIs are grouped by namespaces. |
| 8 | + |
| 9 | + * There's the global namespace, with APIs like the Search API (`GET _search`) or the Index API (`PUT /<target>/_doc/<_id>` and related endpoints). |
| 10 | + * Then there are all the other namespaces, such as: |
| 11 | + * Indices with APIs like the Create index API (`PUT /my-index`), |
| 12 | + * ES|QL with the Run an ES|QL query API (`POST /_async`), |
| 13 | + * and so on. |
| 14 | + |
| 15 | +As a result, when you know which namespace and function you need, you can call the function. Assuming that `client` is an Elasticsearch instance, here is how you would call the examples from above: |
| 16 | + |
| 17 | +* Global namespace: `client.search(...)` and `client.index(...)` |
| 18 | +* Other namespaces: |
| 19 | + * Indices: `client.indices.create(...)` |
| 20 | + * ES|QL: `client.esql.query(...)` |
| 21 | + |
| 22 | +How can you figure out the namespace? |
| 23 | + |
| 24 | +* The [Elasticsearch API docs](https://www.elastic.co/docs/api/doc/elasticsearch/) can help, even though the tags it uses do not fully map to namespaces. |
| 25 | +* You can also use the client documentation, by: |
| 26 | + * browsing the [Elasticsearch API Reference](https://elasticsearch-py.readthedocs.io/en/stable/api.html) page, or |
| 27 | + * searching for your endpoint using [Read the Docs](https://elasticsearch-py.readthedocs.io/) search, which is powered by Elasticsearch! |
| 28 | +* Finally, for Elasticsearch 8.x, most examples in the [Elasticsearch guide](https://www.elastic.co/guide/en/elasticsearch/reference/current/index.html) are also available in Python. (This is still a work in progress for Elasticsearch 9.x.) In the example below, `client.ingest.put_pipeline(...)` is the function that calls the "Create or update a pipeline" API. |
| 29 | + |
| 30 | + |
| 31 | +:::{image} ../images/python-example.png |
| 32 | +:alt: Python code example in the Elasticsearch guide |
| 33 | +::: |
| 34 | + |
| 35 | +## Parameters |
| 36 | + |
| 37 | +Now that you know which functions to call, the next step is parameters. To avoid ambiguity, the Python Elasticsearch client mandates keyword arguments. To give an example, let's look at the ["Create an index" API](https://elasticsearch-py.readthedocs.io/en/stable/api/indices.html#elasticsearch.client.IndicesClient.create). There's only one required parameter, `index`, so the minimal form looks like this: |
| 38 | + |
| 39 | +```python |
| 40 | +from elasticsearch import Elasticsearch |
| 41 | + |
| 42 | +client = Elasticsearch("http://localhost:9200", api_key="...") |
| 43 | + |
| 44 | +client.indices.create(index="my-index") |
| 45 | +``` |
| 46 | + |
| 47 | +You can also use other parameters, including the first level of body parameters, such as: |
| 48 | + |
| 49 | +```python |
| 50 | +resp = client.indices.create( |
| 51 | + index="logs", |
| 52 | + aliases={"logs-alias": {}}, |
| 53 | + mappings={"name": {"type": "text"}}, |
| 54 | +) |
| 55 | +print(resp) |
| 56 | +``` |
| 57 | + |
| 58 | +In this case, the client will send to Elasticsearch the following JSON body: |
| 59 | + |
| 60 | +```console |
| 61 | +PUT /logs |
| 62 | +{ |
| 63 | + "aliases": {"logs-alias": {}}, |
| 64 | + "mappings": {"name": {"type": "text"}} |
| 65 | +} |
| 66 | +``` |
| 67 | + |
| 68 | +## Unknown parameters or APIs |
| 69 | + |
| 70 | +Like other clients, the Python Elasticsearch client is generated from the [Elasticsearch specification](https://github.com/elastic/elasticsearch-specification). While we strive to keep it up to date, it is not (yet!) perfect, and sometimes body parameters are missing. In this case, you can specify the body directly, as follows: |
| 71 | + |
| 72 | +```python |
| 73 | +resp = client.indices.create( |
| 74 | + index="logs", |
| 75 | + body={ |
| 76 | + "aliases": {"logs-alias": {}}, |
| 77 | + "mappings": {"name": {"type": "text"}}, |
| 78 | + "missing_parameter": "foo", |
| 79 | + } |
| 80 | +) |
| 81 | +print(resp) |
| 82 | +``` |
| 83 | + |
| 84 | +In the event where an API is missing, you need to use the low-level `perform_request` function: |
| 85 | + |
| 86 | +```python |
| 87 | +resp = client.perform_request( |
| 88 | + "PUT", |
| 89 | + "/logs" |
| 90 | + index="logs", |
| 91 | + headers={"content-type": "application/json", "accept": "application/json"}, |
| 92 | + body={ |
| 93 | + "aliases": {"logs-alias": {}}, |
| 94 | + "mappings": {"name": {"type": "text"}}, |
| 95 | + "missing_parameter": "foo", |
| 96 | + } |
| 97 | +) |
| 98 | +print(resp) |
| 99 | +``` |
| 100 | + |
| 101 | +One benefit of this function is that it lets you use arbitrary headers, such as the `es-security-runas-user` header used to [impersonate users](https://www.elastic.co/guide/en/elasticsearch/reference/current/run-as-privilege.html). |
| 102 | + |
| 103 | + |
| 104 | +## Options |
| 105 | + |
| 106 | +You can specify options such as request timeouts or retries using the `.options()` API, see the [Configuration](./configuration.md) page for details. |
0 commit comments