Skip to content

Commit

Permalink
Moved sharktank runner to ossci cluster (#990)
Browse files Browse the repository at this point in the history
Progress on #793

Moved sharktank runner to ossci cluster

---------

Signed-off-by: Elias Joseph <[email protected]>
Co-authored-by: Elias Joseph <[email protected]>
Co-authored-by: saienduri <[email protected]>
  • Loading branch information
3 people authored Mar 3, 2025
1 parent da96b47 commit 868afc3
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 5 deletions.
7 changes: 4 additions & 3 deletions .github/workflows/ci-sharktank.yml
Original file line number Diff line number Diff line change
Expand Up @@ -93,15 +93,16 @@ jobs:
strategy:
matrix:
python-version: [3.11]
runs-on: [llama-mi300x-3]
runs-on: [linux-mi300-1gpu-ossci]
fail-fast: false
runs-on: ${{matrix.runs-on}}
defaults:
run:
shell: bash
env:
VENV_DIR: ${{ github.workspace }}/.venv
HF_HOME: "/data/huggingface"
HF_HOME: "/shark-cache/data/huggingface"
HF_TOKEN: ${{ secrets.HF_FLUX_TOKEN }}
steps:
- uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2

Expand Down Expand Up @@ -193,7 +194,7 @@ jobs:
run: |
pytest -v sharktank/ -m punet_quick \
--durations=0 \
--timeout=600
--timeout=900
# Depends on other jobs to provide an aggregate job status.
# TODO(#584): move test_with_data and test_integration to a pkgci integration test workflow?
Expand Down
4 changes: 2 additions & 2 deletions sharktank/tests/models/llama/quark_parity_test.py
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@
class QuarkParityTest(TempDirTestBase):
def setUp(self):
super().setUp()
self.path_prefix = Path("/shark-dev/quark_test")
self.path_prefix = Path("/shark-cache/quark_test")

@with_quark_data
def test_compare_against_quark(self):
Expand Down Expand Up @@ -55,7 +55,7 @@ def test_compare_against_quark(self):
"sharktank.examples.paged_llm_v1",
"The capitol of Texas is",
f"--irpa-file={self.path_prefix}/fp8_bf16_weight.irpa",
f"--tokenizer-config-json=/data/llama3.1/8b/tokenizer.json",
f"--tokenizer-config-json=/shark-dev/data/llama3.1/8b/tokenizer.json",
"--fake-quant",
"--attention-kernel=torch",
"--activation-dtype=bfloat16",
Expand Down

0 comments on commit 868afc3

Please sign in to comment.