You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: rag-playground/basecamp-handbook/baseline.ipynb
+19-19Lines changed: 19 additions & 19 deletions
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
"cells": [
3
3
{
4
4
"cell_type": "markdown",
5
-
"id": "5d7d6d21",
5
+
"id": "8621e835",
6
6
"metadata": {},
7
7
"source": [
8
8
"## Baseline\n",
@@ -13,7 +13,7 @@
13
13
{
14
14
"cell_type": "code",
15
15
"execution_count": 2,
16
-
"id": "fae55e46",
16
+
"id": "a84d1f79",
17
17
"metadata": {},
18
18
"outputs": [
19
19
{
@@ -59,7 +59,7 @@
59
59
{
60
60
"cell_type": "code",
61
61
"execution_count": 4,
62
-
"id": "ea340a1b",
62
+
"id": "a2c5f874",
63
63
"metadata": {},
64
64
"outputs": [
65
65
{
@@ -85,7 +85,7 @@
85
85
{
86
86
"cell_type": "code",
87
87
"execution_count": 5,
88
-
"id": "e59b71b8",
88
+
"id": "a6519df0",
89
89
"metadata": {},
90
90
"outputs": [],
91
91
"source": [
@@ -98,7 +98,7 @@
98
98
{
99
99
"cell_type": "code",
100
100
"execution_count": 7,
101
-
"id": "5b6f4c8b",
101
+
"id": "baa208cd",
102
102
"metadata": {},
103
103
"outputs": [
104
104
{
@@ -118,7 +118,7 @@
118
118
{
119
119
"cell_type": "code",
120
120
"execution_count": 8,
121
-
"id": "821307b9",
121
+
"id": "118d251d",
122
122
"metadata": {},
123
123
"outputs": [
124
124
{
@@ -137,7 +137,7 @@
137
137
},
138
138
{
139
139
"cell_type": "markdown",
140
-
"id": "b5e20959",
140
+
"id": "e165bd7a",
141
141
"metadata": {},
142
142
"source": [
143
143
"### Running Evaluations"
@@ -146,7 +146,7 @@
146
146
{
147
147
"cell_type": "code",
148
148
"execution_count": 11,
149
-
"id": "b7f13a15",
149
+
"id": "4ced68af",
150
150
"metadata": {},
151
151
"outputs": [
152
152
{
@@ -192,7 +192,7 @@
192
192
{
193
193
"cell_type": "code",
194
194
"execution_count": 12,
195
-
"id": "70b5b3b3",
195
+
"id": "e14aafad",
196
196
"metadata": {},
197
197
"outputs": [],
198
198
"source": [
@@ -203,7 +203,7 @@
203
203
{
204
204
"cell_type": "code",
205
205
"execution_count": 13,
206
-
"id": "1a0bc417",
206
+
"id": "d5594de4",
207
207
"metadata": {},
208
208
"outputs": [
209
209
{
@@ -273,7 +273,7 @@
273
273
},
274
274
{
275
275
"cell_type": "markdown",
276
-
"id": "e7c24549",
276
+
"id": "b4529e58",
277
277
"metadata": {},
278
278
"source": [
279
279
"Seems like overall scores are good. Lets look into a bit more to see which questions gave bad results."
@@ -282,7 +282,7 @@
282
282
{
283
283
"cell_type": "code",
284
284
"execution_count": 15,
285
-
"id": "8a0ad424",
285
+
"id": "f2211f35",
286
286
"metadata": {},
287
287
"outputs": [
288
288
{
@@ -407,7 +407,7 @@
407
407
},
408
408
{
409
409
"cell_type": "markdown",
410
-
"id": "f65cb827",
410
+
"id": "57138d44",
411
411
"metadata": {},
412
412
"source": [
413
413
"With a small utility function we can see the question, answer, contexts."
@@ -416,7 +416,7 @@
416
416
{
417
417
"cell_type": "code",
418
418
"execution_count": 45,
419
-
"id": "5b94e540",
419
+
"id": "e496210c",
420
420
"metadata": {},
421
421
"outputs": [],
422
422
"source": [
@@ -451,7 +451,7 @@
451
451
{
452
452
"cell_type": "code",
453
453
"execution_count": 46,
454
-
"id": "9842219a",
454
+
"id": "ec86e750",
455
455
"metadata": {},
456
456
"outputs": [
457
457
{
@@ -492,7 +492,7 @@
492
492
},
493
493
{
494
494
"cell_type": "markdown",
495
-
"id": "d85441a1",
495
+
"id": "ad2639ba",
496
496
"metadata": {},
497
497
"source": [
498
498
"Seems like the question is a bit off and the LLM actually makes a lot of stuff to answer the questions because the retrieved context is not very good.\n",
0 commit comments