Feature/ocaml coq #8

jmikedupont2 · 2023-12-12T01:28:02Z

No description provided.

* typos * Update examples/parallel/README.md Co-authored-by: Kerfuffle <[email protected]> --------- Co-authored-by: Kerfuffle <[email protected]>

* gguf-py: gguf_writer: Use BytesIO to build metadata * Use bytearray instead Bump gguf-py package version

…ml-org#4041) * Add ReLU and SQR CUDA ops to fix Persimmon offloading * Persimmon loader: More helpful error on CUDA/ROCM when offloading too many layers

* sync : ggml (backend v2) (wip) * sync : migrate examples and llama.cpp to dynamic graphs (wip) * sync : update tests + fix max op params to 64 ggml-ci * sync : ggml-cuda ggml-ci * llama : fix save/load state context size ggml-ci * sync : try to fix build on tvOS * sync : pass custom graph sizes in training examples * sync : update graph copies to new ggml API * sync : update sync-ggml.sh with new files * scripts : fix header in sync script * train : fix context size calculations * llama : increase inference graph size up to 4096 nodes * train : allocate grads for backward graphs * train : allocate grads for gb_tmp

ggml-ci

)

* add safetensors to convert.py help message * Check for single-file safetensors model * Update convert.py "model" option help message * revert convert.py help message change

* Add support for stablelm-3b-4e1t * Supports GPU offloading of (n-1) layers

Co-authored-by: Jared Van Bortel <[email protected]> Co-authored-by: Georgi Gerganov <[email protected]>

Co-authored-by: Bernhard Gstrein <[email protected]>

…4040) * gguf-py: gguf-dump: Respect --no-tensor flag in JSON mode. * Respect add_bos_token GGUF metadata value * gguf-py: Try to fix SpecialVocab giving up too easily for the Nth time

* llama : fix data units ggml-ci * Revert "llama : fix data units" This reverts commit f5feac8. * llama : disambiguate data units ggml-ci

* Fix ggml-org#4017 * Update ggml-cuda.cu Co-authored-by: Jared Van Bortel <[email protected]> * Update ggml-cuda.cu Co-authored-by: Jared Van Bortel <[email protected]> --------- Co-authored-by: Jared Van Bortel <[email protected]>

* finetune : zero the loraB initial vectors Without this, the first iteration is starting out far from the base model, instead of exactly on it. Zeroing loraB is what the paper recommends. loralib also zeroes at least one of the init vector pairs (though it departs from the paper in using a different distribution for the other vector, in some cases). * tabs to spaces * Use ggml_set_zero instead of adding a new function

…org#4079) * Remove logically superfluous assertions and order by dimension * Use cblas_sgemm() to implement ggml_compute_forward_out_prod() * Remove ggml_compute_forward_out_prod_use_blas(), fix compiling errors on cmake/zig, remove trailing whitespace * Add openBLAS support for sgemm() in compute_forward_out_prod()

* llama : add functions to get the model's metadata * format -> std::to_string * better documentation

ggml-org#4074) - introduces help entry for the argument - cuts '--gpu-layers' form in order to simplify usage and documentation. Signed-off-by: Jiri Podivin <[email protected]> Co-authored-by: Jiri Podivin <[email protected]>

Signed-off-by: Jiri Podivin <[email protected]> Co-authored-by: Jiri Podivin <[email protected]>

…gml-org#4069)

* logging: improve escaping in yaml output * logging: include review feedback

Falcon HF compatibility

…amas to load (ggml-org#4089) Co-authored-by: Don Mahurin <@>

* build: support ppc64le build for make and CMake * build: keep __POWER9_VECTOR__ ifdef and extend with __powerpc64__ Co-authored-by: Georgi Gerganov <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>

…gml-org#4124) * ggml-cuda.cu: Clean up warnings when compiling with clang * ggml-cuda.cu: Move static items into anonymous namespace * ggml-cuda.cu: Fix use of namespace start macro * Revert "ggml-cuda.cu: Fix use of namespace start macro" This reverts commit 26c1149. * Revert "ggml-cuda.cu: Move static items into anonymous namespace" This reverts commit e29757e.

richardkiss and others added 30 commits November 11, 2023 23:04

Fix some documentation typos/grammar mistakes (ggml-org#4032)

532dd74

* typos * Update examples/parallel/README.md Co-authored-by: Kerfuffle <[email protected]> --------- Co-authored-by: Kerfuffle <[email protected]>

gguf-py: gguf_writer: Use bytearray to build metadata (ggml-org#4051)

21fd874

* gguf-py: gguf_writer: Use BytesIO to build metadata * Use bytearray instead Bump gguf-py package version

Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (gg…

bb50a79

…ml-org#4041) * Add ReLU and SQR CUDA ops to fix Persimmon offloading * Persimmon loader: More helpful error on CUDA/ROCM when offloading too many layers

readme : update hot topics

c049b37

ggml : sync (im2col, GPU conv, 32-bit arm compat) (ggml-org#4060)

3d68f36

ggml-ci

llava : fix regression for square images in ggml-org#3613 (ggml-org#4056

bd90eca

)

convert.py: also look for plain model.safetensors (ggml-org#4043)

b46d12f

* add safetensors to convert.py help message * Check for single-file safetensors model * Update convert.py "model" option help message * revert convert.py help message change

stablelm : StableLM support (ggml-org#3586)

36eed0c

* Add support for stablelm-3b-4e1t * Supports GPU offloading of (n-1) layers

Fix MacOS Sonoma model quantization (ggml-org#4052)

6bb4908

Co-authored-by: Jared Van Bortel <[email protected]> Co-authored-by: Georgi Gerganov <[email protected]>

ggml-cuda : increase max graph size (ggml-org#4084)

1cf2850

llama : restore prefix space in llama tokenizer (ggml-org#4081)

a6fc554

gguf : fix potential infinite loops while parsing (ggml-org#4100)

8da4627

Co-authored-by: Bernhard Gstrein <[email protected]>

Respect tokenizer.ggml.add_bos_token value when tokenizing (ggml-org#…

91f6499

…4040) * gguf-py: gguf-dump: Respect --no-tensor flag in JSON mode. * Respect add_bos_token GGUF metadata value * gguf-py: Try to fix SpecialVocab giving up too easily for the Nth time

llama : fix data units (ggml-org#4101)

4f447a4

* llama : fix data units ggml-ci * Revert "llama : fix data units" This reverts commit f5feac8. * llama : disambiguate data units ggml-ci

cuda : get_row_rounding F32 (ggml-org#4095)

b83e149

* Fix ggml-org#4017 * Update ggml-cuda.cu Co-authored-by: Jared Van Bortel <[email protected]> * Update ggml-cuda.cu Co-authored-by: Jared Van Bortel <[email protected]> --------- Co-authored-by: Jared Van Bortel <[email protected]>

llama : add functions to get the model's metadata (ggml-org#4013)

e85bb1a

* llama : add functions to get the model's metadata * format -> std::to_string * better documentation

py : remove superfluous import statements (ggml-org#4076)

f7d5e97

Signed-off-by: Jiri Podivin <[email protected]> Co-authored-by: Jiri Podivin <[email protected]>

llava : fix compilation warning that fread return value is not used (g…

c7cce12

…gml-org#4069)

common : improve yaml log escaping (ggml-org#4080)

9e87ef6

* logging: improve escaping in yaml output * logging: include review feedback

py : Falcon HF compatibility (ggml-org#4104)

11173c9

Falcon HF compatibility

convert : use 'model' value if it exists. This allows karpathy/tinyll…

2ab0707

…amas to load (ggml-org#4089) Co-authored-by: Don Mahurin <@>

examples : add tokenize (ggml-org#4039)

2fa02b4

tokenize : fix trailing whitespace

5ad387e

llama : increase max nodes (ggml-org#4115)

bbecf3f

mike dupont added 28 commits December 5, 2023 11:06

rebased

5ea96cc

starting boost

2f3ea04

working calling python

1c86146

now getting response from python

7972929

now it is letting the llm control the output

7eb27b3

adding missing files

d6244ff

linking

ac69c93

linking, loading, segfaulting

09a48ec

update

593985d

nodejs

9ec7eb1

for metacall add the cmake

1193766

not executing

34cf9d6

dont forget the code

da1d845

wip

1f52231

working

1f3a501

linker error

da5bbd7

now linking and crashing

d739470

makefile now building and exec crashing

d239bc9

cmake not working yet

fa49f64

fixing first bug

9592bc5

remove bad cast

ad7fc54

now working v1

7025832

now linking locally

b2a5e70

not linking

f62bbea

now linking

f9d918a

now calling parse and failing not crashing

6d4ca95

now not stopping

e30080b

now getting error messages

4676a64

jmikedupont2 mentioned this pull request Dec 14, 2023

Feature/introspector meta-introspector/coq-elpi#1

Open

jmikedupont2 pushed a commit that referenced this pull request Dec 18, 2023

kompute : enable kp_logger and make it static (#8)

21841d3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/ocaml coq #8

Feature/ocaml coq #8

jmikedupont2 commented Dec 12, 2023

Feature/ocaml coq #8

Are you sure you want to change the base?

Feature/ocaml coq #8

Conversation

jmikedupont2 commented Dec 12, 2023