Search code, repositories, users, issues, pull requests...

android : use "ci-android" branch for CI

Generally require more time to grok but manageable by beginner to medium expertise level

#7342 opened May 17, 2024 by besnardjb

Loading…

android

Issues specific to Android

devops

improvements to build systems and github actions

github-actions-labeler: initial commit devops

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

#7341 opened May 17, 2024 by ggerganov

Loading…

improvements to build systems and github actions

Generally require more time to grok but manageable by beginner to medium expertise level

#7330 opened May 16, 2024 by mofosyne

Loading…

add Viking tokenizer support model

Model specific

python

python script changes

Viking-7B tokenizer support

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

#7329 opened May 16, 2024 by jonabur

Loading…

model

Model specific

python

python script changes

Fixed painfully slow single process builds.

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

#7328 opened May 16, 2024 by akx • Draft

build

Compilation issues

need feedback

Testing and feedback with results are needed

performance

Speed related topics

#7326 opened May 16, 2024 by jboero

Loading…

Add support for larger Granite Code Models (20B, 34B)

model

Model specific

[SYCL] Update SYCL upscale operation

Generally require more time to grok but manageable by beginner to medium expertise level

#7324 opened May 16, 2024 by sroecker

Loading…

generation quality

Quality of model output

sched : support async weight copy

Generally require more time to grok but manageable by beginner to medium expertise level

SYCL

https://en.wikipedia.org/wiki/SYCL - GPU programming language

#7321 opened May 16, 2024 by AidanBeltonS

Loading…

performance

Speed related topics

ggml : fix quants nans when all the group weights are very close to zero

Generally require more time to grok but manageable by beginner to medium expertise level

#7315 opened May 15, 2024 by slaren • Draft

bugfix

fixes an issue or bug

Generally require more time to grok but manageable by beginner to medium expertise level

#7313 opened May 15, 2024 by slaren

Loading…

Add phi-2 tokenizer model

Model specific

Capture CUDA logging output

Generally require more time to grok but manageable by beginner to medium expertise level

#7300 opened May 15, 2024 by BramVanroy

Loading…

enhancement

New feature or request

Nvidia GPU

Issues specific to Nvidia GPUs

avoid to get prompt in infill mode and embedding mode

Generally require more time to grok but manageable by beginner to medium expertise level

#7298 opened May 15, 2024 by fraxy-v

Loading…

examples review complexity : low

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

server

#7286 opened May 14, 2024 by woodx9 • Draft

common: free ctx_gguf when exiting llama_control_vector_load_one

bugfix

fixes an issue or bug

ggml-opencl, llama: using reserve() if count already known

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

#7285 opened May 14, 2024 by stevegrubb

Loading…

common, ngram_cache: added const reference for std::pair<> and std::tuple<> more 16 bytes:

Refactoring

review complexity : high

Generally require indepth knowledge of LLMs or GPUs

#7272 opened May 14, 2024 by GermanAizek • Draft

Refactoring

ggml, ngram-cache, log: added const and const ref for function params

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

#7270 opened May 14, 2024 by GermanAizek • Draft

Refactoring

ggml llama: align structs for memory optimization on 64-bit platforms

Generally require more time to grok but manageable by beginner to medium expertise level

#7269 opened May 14, 2024 by GermanAizek

Loading…

Refactoring