File tree
267 files changed
+4505
-4898
lines changed- .github/workflows
- benchmarks
- cpp
- utils
- python
- cpp
- include/tensorrt_llm
- batch_manager
- executor
- runtime
- tensorrt_llm
- batch_manager
- aarch64-linux-gnu
- x86_64-linux-gnu
- common
- cutlass_extensions/include/cutlass_extensions/gemm
- collective
- builders
- kernel
- executor
- aarch64-linux-gnu
- x86_64-linux-gnu
- kernels
- contextFusedMultiHeadAttention
- cutlass_kernels
- fpA_intB_gemm/launchers
- decoderMaskedMultiheadAttention
- decoderXQAImplJIT
- nvrtcWrapper
- aarch64-linux-gnu
- x86_64-linux-gnu
- internal_cutlass_kernels
- aarch64-linux-gnu
- x86_64-linux-gnu
- selectiveScan
- speculativeDecoding
- unfusedAttentionKernels
- userbuffers
- weightOnlyBatchedGemv
- layers
- plugins
- gptAttentionCommon
- gptAttentionPlugin
- ncclPlugin
- pybind
- batch_manager
- executor
- runtime
- thop
- tests
- kernels
- allReduce
- resources/scripts
- runtime
- utils
- docs/source
- advanced
- installation
- reference
- examples
- baichuan
- bert
- bindings/executor
- bloom
- chatglm
- commandr
- dbrx
- deepseek_v1
- dit
- draft_target_model
- eagle
- enc_dec
- falcon
- gemma
- gptj
- gptneox
- gpt
- grok
- internlm
- jais
- llama
- llm-api
- lookahead
- mamba
- medusa
- mixtral
- mllama
- mpt
- multimodal
- nemotron
- openai_triton
- manual_plugin
- plugin_autogen
- opt
- phi
- prompt_lookup
- python_plugin
- quantization
- qwenvl
- qwen
- recurrentgemma
- redrafter
- sdxl
- skywork
- smaug
- whisper
- tensorrt_llm
- auto_parallel
- tensor_parallel
- plugin_nodes
- bench
- build
- dataclasses
- utils
- layers
- llmapi
- models
- eagle
- gemma
- utils
- llama
- mllama
- nemotron_nas
- phi3
- qwen
- quantization
- runtime
- tests
- attention
- bindings
- functional
- llmapi
- _perf_evaluator
- apps
- model
- quantization
- tools/plugin_gen
- utils
- windows
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
267 files changed
+4505
-4898
lines changedLines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
13 | 13 |
| |
14 | 14 |
| |
15 | 15 |
| |
16 |
| - | |
| 16 | + | |
17 | 17 |
| |
18 | 18 |
| |
19 | 19 |
| |
|
File renamed without changes.
Lines changed: 2 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
9 | 9 |
| |
10 | 10 |
| |
11 | 11 |
| |
12 |
| - | |
| 12 | + | |
| 13 | + | |
13 | 14 |
|
Lines changed: 2 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
7 | 7 |
| |
8 | 8 |
| |
9 | 9 |
| |
10 |
| - | |
11 |
| - | |
| 10 | + | |
| 11 | + | |
12 | 12 |
| |
13 | 13 |
| |
14 | 14 |
| |
|
Lines changed: 26 additions & 6 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
| |
11 |
| - | |
| 11 | + | |
12 | 12 |
| |
13 | 13 |
| |
14 | 14 |
| |
| |||
17 | 17 |
| |
18 | 18 |
| |
19 | 19 |
| |
20 |
| - | |
21 |
| - | |
| 20 | + | |
| 21 | + | |
22 | 22 |
| |
23 |
| - | |
| 23 | + | |
24 | 24 |
| |
25 | 25 |
| |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
26 | 45 |
| |
27 | 46 |
| |
28 | 47 |
| |
| |||
46 | 65 |
| |
47 | 66 |
| |
48 | 67 |
| |
| 68 | + | |
| 69 | + | |
| 70 | + | |
49 | 71 |
| |
50 | 72 |
| |
51 | 73 |
| |
| |||
61 | 83 |
| |
62 | 84 |
| |
63 | 85 |
| |
64 |
| - | |
65 |
| - | |
66 | 86 |
| |
67 | 87 |
| |
68 | 88 |
| |
|
0 commit comments