Start adding content for GenAI docs

natke · natke · commit 8e24b439edc7 · 2024-02-18T04:47:44.000-08:00
diff --git a/docs/genai/howto/install.md b/docs/genai/howto/install.md
@@ -0,0 +1,69 @@
+---
+title: Install ONNX Runtime GenAI
+description: Instructions to install ONNX Runtime GenAI on your target platform in your environment
+has_children: false
+nav_order: 1
+---
+
+# Install ONNX Runtime GenAI
+
+## Python package
+
+(Coming soon) `pip install onnxruntime-genai`
+
+(Temporary)
+1. Build from source
+
+   Follow the instructions in [build-from-source.md]
+
+2. Install wheel
+
+   ```bash
+   cd build/wheel
+   pip install onnxruntime-genai*.whl
+   ```
+
+## C# package
+
+(Coming soon) `dotnet add package Microsoft.ML.OnnxRuntime.GenAI`
+
+(Temporary)
+1. Build from source
+
+   Follow the instructions in [build-from-source.md]
+
+2. Build nuget package
+
+   ```cmd
+   nuget.exe pack Microsoft.ML.OnnxRuntimeGenAI.nuspec -Prop version=0.1.0 -Prop id="Microsoft.ML.OnnxRuntimeGenAI.Gpu"
+   ```
+
+3. Install the nuget package
+
+   ```cmd
+   dotnet add package .. local instructions
+   ```
+
+
+## C artifacts
+
+(Coming soon) Download release archive
+
+Unzip archive
+
+(Temporary)
+1. Build from source
+
+   Follow the instructions in [build-from-source.md]
+
+   
+2. Use the following include locations to build your C application
+
+   * 
+
+3. Use the following library locations to build your C application
+
+   * 
+
+   
+
diff --git a/docs/genai/index.md b/docs/genai/index.md
@@ -0,0 +1,9 @@
+# Generative AI with ONNX Runtime
+
+Run generative AI models with ONNX Runtime.
+
+This library provides the generative AI loop for ONNX models, including inference with ONNX Runtime, logits processing, search and sampling, and KV cache management.
+
+Users can call a high level `generate()` method, or run each iteration of the model in a loop, generating one token at a time, and optionally updating generation parameters inside the loop.
+
+It has support for greedy/beam search and TopP, TopK sampling to generate token sequences and built-in logits processing like repetition penalties. You can also easily add custom scoring.