Skip to content

Commit 769810d

Browse files
authored
Merge pull request #727 from rabwill/model-blog-01
updates
2 parents a1c9fff + 2d453c5 commit 769810d

File tree

1 file changed

+60
-6
lines changed

1 file changed

+60
-6
lines changed

docs/pages/beyond-agents/model-selection.md

Lines changed: 60 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,23 @@ Think of these models like a kitchen full of specialized tools. You wouldn't use
6464

6565
Now comes the fun part where you will actually test these models! The GitHub Models Playground is your sandbox for experimentation. Here's where the magic happens.
6666

67-
> You'll need a [GitHub account](https://github.com/signup) to proceed.
67+
### Setup Requirements
68+
69+
**Prerequisites:**
70+
1. **GitHub Account**: [Create one free](https://github.com/signup) if needed
71+
2. **Access Verification**: Visit [GitHub Models Marketplace](https://github.com/marketplace/models)
72+
3. **Catalog Familiarity**: Browse [available models](https://github.com/marketplace?type=models)
73+
74+
**Navigation Strategy:**
75+
- **Filter by Publisher**: Focus on established AI providers
76+
- **Filter by Capability**: Select `Chat/Completion` for text tasks
77+
- **Filter by Category**: Choose based on your needs:
78+
- **All**: General question-answering
79+
- **Instruction**: Specialized domains
80+
- **Multimodal**: Image and text processing
81+
- **Audio**: Speech processing
82+
- **Reasoning**: Complex problem-solving
83+
- **Multilingual**: Multiple language support
6884

6985
### Getting Started
7086

@@ -113,15 +129,53 @@ The playground removes the guesswork and you see real results in real time.
113129

114130
---
115131

116-
## 🔄 Step 4: Review, Iterate, and Improve
132+
## 🔄 Step 4: Continuous Optimization Strategy
117133

118-
Model selection isn't a one-and-done decision. It's a process of continuous refinement.
134+
Model selection requires ongoing refinement as your needs evolve and new models become available.
119135

136+
### Optimization Approach
120137

121-
## 🚀 Bonus step: Cost and other important stuff
138+
**Initial Implementation:**
139+
Choose your best-performing model based on testing results and deploy it for regular use.
122140

123-
This final step is a bonus, but powerful.
124-
You can compare costs, quality and other things for models using [leaderboard](https://ai.azure.com/explore/models/leaderboard)
141+
**Performance Monitoring:**
142+
Track real-world performance over time. Note any patterns where results don't meet expectations.
143+
144+
**Periodic Evaluation:**
145+
Quarterly, test new or updated models against your current choice using your standard test cases.
146+
147+
**Strategic Adjustment:**
148+
Update your model selection when you find measurably better performance for your specific use cases.
149+
150+
### Advanced Considerations
151+
152+
**Cost-Benefit Analysis:**
153+
Evaluate whether premium models justify their cost through improved efficiency or quality that saves time or delivers better outcomes.
154+
155+
**Edge Case Management:**
156+
Maintain a collection of challenging requests that reveal model limitations. Use these for testing new models.
157+
158+
**Performance Documentation:**
159+
Keep records of what works well for different scenarios. This knowledge base becomes invaluable for future decisions.
160+
161+
---
162+
163+
## 🚀 Practical Considerations
164+
165+
### Cost and Performance Analysis
166+
167+
Use the [Azure AI Model Leaderboard](https://ai.azure.com/explore/models/leaderboard) to compare:
168+
- **Cost per request**: Budget planning and ROI calculation
169+
- **Performance metrics**: Objective quality measurements
170+
- **Speed benchmarks**: Response time requirements
171+
172+
### Professional Tips
173+
174+
**Efficiency Focus**: Most tasks work well with mid-tier models. Reserve premium options for scenarios where quality differences significantly impact outcomes.
175+
176+
**Documentation Practice**: Maintain simple records of successful model-task combinations for future reference.
177+
178+
**Stay Current**: Test new models regularly as capabilities and options evolve rapidly.
125179

126180
## 📚 Resources
127181

0 commit comments

Comments
 (0)