You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: explore-analyze/elastic-inference/eis.md
+8-5Lines changed: 8 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -75,17 +75,22 @@ To track your token consumption:
75
75
1. Navigate to [**Billing and subscriptions > Usage**](https://cloud.elastic.co/billing/usage) in the {{ecloud}} Console
76
76
2. Look for line items where the **Billing dimension** is set to "Inference"
77
77
78
+
### Fair usage during free trial
79
+
80
+
Accounts in the free trial period are subject to token limits that are considered "fair usage". Access to some models may be paused temporarily if this limit is exceeded.
81
+
82
+
Fair usage limits while account is in free trial:
83
+
-**Elastic Managed LLM:** 100 million input tokens in 24h or 5 million output tokens in 24h
84
+
-**ELSER**: 1 billion tokens in 24h
85
+
78
86
## Rate limits
79
87
80
88
The service enforces rate limits on an ongoing basis. Exceeding a limit will result in HTTP 429 responses from the server until the sliding window moves on further and parts of the limit resets.
81
89
82
-
Accounts in the free trial period are subject to token limits that are considered "fair usage". Access to some models may be paused temporarily if this limit is exceeded.
83
-
84
90
### Elastic Managed LLM
85
91
86
92
- 50 requests per minute
87
93
- No rate limit on tokens
88
-
- Fair usage limit while account is in free trial: 100 million input tokens in 24h or 5 million output tokens in 24h
89
94
90
95
### ELSER (Sparse Embeddings)
91
96
@@ -95,10 +100,8 @@ We limit on both requests per minute and tokens per minute (whichever limit is r
95
100
96
101
- 6,000 request per minute
97
102
- 6,000,000 tokens per minute
98
-
- Fair usage limit while account is in free trial: 1 billion tokens in 24h
99
103
100
104
#### Search
101
105
102
106
- 6,000 requests per minute
103
107
- 600,000 tokens per minute
104
-
- Fair usage limit while account is in free trial: 1 billion tokens in 24h
0 commit comments