{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":612354784,"defaultBranch":"master","name":"llama.cpp","ownerLogin":"ggerganov","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2023-03-10T18:58:00.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/1991296?v=4","public":true,"private":false,"isOrgOwned":false},"refInfo":{"name":"","listCacheKey":"v0:1718285924.0","currentOid":""},"activityList":{"items":[{"before":"a55eb1bf0fa2fd84147bdfd384391e029d988253","after":"172c8256840ffd882ab9992ecedbb587d9b21f15","ref":"refs/heads/master","pushedAt":"2024-06-13T12:18:45.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"rgerganov","name":"Radoslav Gerganov","path":"/rgerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/271616?s=80&v=4"},"commit":{"message":"rpc : fix ggml_backend_rpc_supports_buft() (#7918)","shortMessageHtmlLink":"rpc : fix ggml_backend_rpc_supports_buft() (#7918)"}},{"before":null,"after":"18133cab40075e4cf8f953440a2d45fdfaf2a04e","ref":"refs/heads/codeplay/revert-host-alloc","pushedAt":"2024-06-13T11:09:23.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"joeatodd","name":"Joe Todd","path":"/joeatodd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5213866?s=80&v=4"},"commit":{"message":"Revert \"use the correct SYCL context for host USM allocations\"\n\nManually reverting:\nhttps://github.com/ggerganov/llama.cpp/pull/7858\n\nSigned-off-by: Joe Todd ","shortMessageHtmlLink":"Revert \"use the correct SYCL context for host USM allocations\""}},{"before":null,"after":"abd7c7b8c26c5b97ceed5e8460655fa5d6f379ed","ref":"refs/heads/codeplay/unify-rope-sycl","pushedAt":"2024-06-13T09:37:10.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"joeatodd","name":"Joe Todd","path":"/joeatodd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5213866?s=80&v=4"},"commit":{"message":"Formatting\n\nSigned-off-by: Joe Todd ","shortMessageHtmlLink":"Formatting"}},{"before":"f578b86b2123d0f92afbaa98a031df4d4464e582","after":"a55eb1bf0fa2fd84147bdfd384391e029d988253","ref":"refs/heads/master","pushedAt":"2024-06-13T07:42:42.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"Galunid","name":null,"path":"/Galunid","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/10298730?s=80&v=4"},"commit":{"message":"readme : Remove outdated instructions from README.md (#7914) [no ci]","shortMessageHtmlLink":"readme : Remove outdated instructions from README.md (#7914) [no ci]"}},{"before":"f32f17a781e908df90e184df08183a936139fcbf","after":"d342abca575309263f21069a1fcd11ffbdc86055","ref":"refs/heads/sycl-remove-global-variables","pushedAt":"2024-06-13T03:49:46.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"airMeng","name":"Meng, Hengyu","path":"/airMeng","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/39229107?s=80&v=4"},"commit":{"message":"trim white space","shortMessageHtmlLink":"trim white space"}},{"before":"211fb045f1c9cd4c949389817624ec510c0fcefd","after":null,"ref":"refs/heads/sl/blas-backend","pushedAt":"2024-06-13T01:11:37.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"}},{"before":"1c641e6aac5c18b964e7b32d9dbbb4bf5301d0d7","after":"f578b86b2123d0f92afbaa98a031df4d4464e582","ref":"refs/heads/master","pushedAt":"2024-06-13T01:11:35.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"},"commit":{"message":"move BLAS to a separate backend (#6210)\n\n* move BLAS to a separate backend\r\n\r\n* rename GGML_USE_OPENBLAS to GGML_USE_BLAS\r\n\r\n* alloc : reuse same buffer when the same buffer type if used multiple times\r\n\r\n* set number of threads automatically for openblas and blis\r\n\r\n* sched : print assignments when GGML_SCHED_DEBUG env variable is set\r\n\r\n* sched : allow ops with weights on an incompatible buffer type\r\n\r\nThis will cause the weight to be copied to a backend that supports the\r\nop, which is very costly. The weight should have been stored in a buffer\r\nof a backend that can run the op, but llama.cpp cannot do this\r\nautomatically at the moment.\r\n\r\n---------\r\n\r\nCo-authored-by: Georgi Gerganov ","shortMessageHtmlLink":"move BLAS to a separate backend (#6210)"}},{"before":"ae9cd856980696e26e1d7f8df3737572ea304927","after":"211fb045f1c9cd4c949389817624ec510c0fcefd","ref":"refs/heads/sl/blas-backend","pushedAt":"2024-06-13T00:44:05.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"},"commit":{"message":"sched : allow ops with weights on an incompatible buffer type\n\nThis will cause the weight to be copied to a backend that supports the\nop, which is very costly. The weight should have been stored in a buffer\nof a backend that can run the op, but llama.cpp cannot do this\nautomatically at the moment.\n\nggml-ci","shortMessageHtmlLink":"sched : allow ops with weights on an incompatible buffer type"}},{"before":"a54b791211823f0c0cbf74aa317c09e501440967","after":"ae9cd856980696e26e1d7f8df3737572ea304927","ref":"refs/heads/sl/blas-backend","pushedAt":"2024-06-13T00:19:20.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"},"commit":{"message":"fix metal being used in layers not offloaded","shortMessageHtmlLink":"fix metal being used in layers not offloaded"}},{"before":"963552903f51043ee947a8deeaaa7ec00bc3f1a4","after":"1c641e6aac5c18b964e7b32d9dbbb4bf5301d0d7","ref":"refs/heads/master","pushedAt":"2024-06-12T23:41:53.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"ochafik","name":"Olivier Chafik","path":"/ochafik","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/273860?s=80&v=4"},"commit":{"message":"`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)\n\n* `main`/`server`: rename to `llama` / `llama-server` for consistency w/ homebrew\r\n\r\n* server: update refs -> llama-server\r\n\r\ngitignore llama-server\r\n\r\n* server: simplify nix package\r\n\r\n* main: update refs -> llama\r\n\r\nfix examples/main ref\r\n\r\n* main/server: fix targets\r\n\r\n* update more names\r\n\r\n* Update build.yml\r\n\r\n* rm accidentally checked in bins\r\n\r\n* update straggling refs\r\n\r\n* Update .gitignore\r\n\r\n* Update server-llm.sh\r\n\r\n* main: target name -> llama-cli\r\n\r\n* Prefix all example bins w/ llama-\r\n\r\n* fix main refs\r\n\r\n* rename {main->llama}-cmake-pkg binary\r\n\r\n* prefix more cmake targets w/ llama-\r\n\r\n* add/fix gbnf-validator subfolder to cmake\r\n\r\n* sort cmake example subdirs\r\n\r\n* rm bin files\r\n\r\n* fix llama-lookup-* Makefile rules\r\n\r\n* gitignore /llama-*\r\n\r\n* rename Dockerfiles\r\n\r\n* rename llama|main -> llama-cli; consistent RPM bin prefixes\r\n\r\n* fix some missing -cli suffixes\r\n\r\n* rename dockerfile w/ llama-cli\r\n\r\n* rename(make): llama-baby-llama\r\n\r\n* update dockerfile refs\r\n\r\n* more llama-cli(.exe)\r\n\r\n* fix test-eval-callback\r\n\r\n* rename: llama-cli-cmake-pkg(.exe)\r\n\r\n* address gbnf-validator unused fread warning (switched to C++ / ifstream)\r\n\r\n* add two missing llama- prefixes\r\n\r\n* Updating docs for eval-callback binary to use new `llama-` prefix.\r\n\r\n* Updating a few lingering doc references for rename of main to llama-cli\r\n\r\n* Updating `run-with-preset.py` to use new binary names.\r\nUpdating docs around `perplexity` binary rename.\r\n\r\n* Updating documentation references for lookup-merge and export-lora\r\n\r\n* Updating two small `main` references missed earlier in the finetune docs.\r\n\r\n* Update apps.nix\r\n\r\n* update grammar/README.md w/ new llama-* names\r\n\r\n* update llama-rpc-server bin name + doc\r\n\r\n* Revert \"update llama-rpc-server bin name + doc\"\r\n\r\nThis reverts commit e474ef1df481fd8936cd7d098e3065d7de378930.\r\n\r\n* add hot topic notice to README.md\r\n\r\n* Update README.md\r\n\r\n* Update README.md\r\n\r\n* rename gguf-split & quantize bins refs in **/tests.sh\r\n\r\n---------\r\n\r\nCo-authored-by: HanClinto ","shortMessageHtmlLink":"build: rename main → llama-cli, server → llama-server, llava-cli → …"}},{"before":"fee3c1d740c0e027c81e2f2f3fb48d619857175f","after":"33425a7e1ed366082a2dbf64f2485531471515e0","ref":"refs/heads/compilade/refactor-kv-cache","pushedAt":"2024-06-12T17:14:12.000Z","pushType":"push","commitsCount":83,"pusher":{"login":"compilade","name":null,"path":"/compilade","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/113953597?s=80&v=4"},"commit":{"message":"mamba : fix non-contiguous usage of ggml_silu","shortMessageHtmlLink":"mamba : fix non-contiguous usage of ggml_silu"}},{"before":"a9cae48003dfc4fe95b8f5c81682fc6e63425235","after":"963552903f51043ee947a8deeaaa7ec00bc3f1a4","ref":"refs/heads/master","pushedAt":"2024-06-12T15:41:51.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"JohannesGaessler","name":"Johannes Gäßler","path":"/JohannesGaessler","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/18492268?s=80&v=4"},"commit":{"message":"CUDA: fix broken oob check for FA vec f32 kernel (#7904)","shortMessageHtmlLink":"CUDA: fix broken oob check for FA vec f32 kernel (#7904)"}},{"before":null,"after":"a9cae48003dfc4fe95b8f5c81682fc6e63425235","ref":"refs/heads/codeplay/sycl-main","pushedAt":"2024-06-12T15:37:40.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"AidanBeltonS","name":null,"path":"/AidanBeltonS","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/87009434?s=80&v=4"},"commit":{"message":"tests : add non-cont unary tests (#7857)\n\n* tests : add non-cont unary tests\r\n\r\n* ggml : update unary asserts and \"supports_op\"\r\n\r\nggml-ci","shortMessageHtmlLink":"tests : add non-cont unary tests (#7857)"}},{"before":"a9cae48003dfc4fe95b8f5c81682fc6e63425235","after":"46325233c9167c48ae36e17aed732fb57e01edda","ref":"refs/heads/revert-7777-host-usm-context-fix","pushedAt":"2024-06-12T15:23:22.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"AidanBeltonS","name":null,"path":"/AidanBeltonS","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/87009434?s=80&v=4"},"commit":{"message":"Revert 7777","shortMessageHtmlLink":"Revert 7777"}},{"before":"4e4ff76a0f0365bb4a2a210953ef6b1a042b8dde","after":"a9cae48003dfc4fe95b8f5c81682fc6e63425235","ref":"refs/heads/revert-7777-host-usm-context-fix","pushedAt":"2024-06-12T15:07:59.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"AidanBeltonS","name":null,"path":"/AidanBeltonS","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/87009434?s=80&v=4"},"commit":{"message":"tests : add non-cont unary tests (#7857)\n\n* tests : add non-cont unary tests\r\n\r\n* ggml : update unary asserts and \"supports_op\"\r\n\r\nggml-ci","shortMessageHtmlLink":"tests : add non-cont unary tests (#7857)"}},{"before":"bfaa676b0841617d4ef3596e63aca6be1a8eb1b5","after":"a9cae48003dfc4fe95b8f5c81682fc6e63425235","ref":"refs/heads/master","pushedAt":"2024-06-12T13:00:22.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"tests : add non-cont unary tests (#7857)\n\n* tests : add non-cont unary tests\r\n\r\n* ggml : update unary asserts and \"supports_op\"\r\n\r\nggml-ci","shortMessageHtmlLink":"tests : add non-cont unary tests (#7857)"}},{"before":"b64daedca3619fdebac18e9bbd1ff6dd2fe275fb","after":"8412561c4b39a540e53e7fdde0078e5ab3adb908","ref":"refs/heads/gg/unary-non-cont","pushedAt":"2024-06-12T12:25:20.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"ggml : update unary asserts and \"supports_op\"\n\nggml-ci","shortMessageHtmlLink":"ggml : update unary asserts and \"supports_op\""}},{"before":"704a35b183748954013bd875bbbfdd9eaca14e62","after":"bfaa676b0841617d4ef3596e63aca6be1a8eb1b5","ref":"refs/heads/master","pushedAt":"2024-06-12T12:24:20.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"ggml : improve ggml_is_contiguous logic (#7856)\n\n* ggml : improve ggml_is_contiguous logic\r\n\r\nggml-ci\r\n\r\n* ggml : support more contiguous cases\r\n\r\nggml-ci","shortMessageHtmlLink":"ggml : improve ggml_is_contiguous logic (#7856)"}},{"before":"f2a029bd9d8bde1aaef4367c6ffa9e8ad48bdcbd","after":"cd026b48ef2cb5e5f12d2cd506ccac090cf0b729","ref":"refs/heads/gg/ggml-cont","pushedAt":"2024-06-12T12:13:11.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"ggml : support more contiguous cases\n\nggml-ci","shortMessageHtmlLink":"ggml : support more contiguous cases"}},{"before":"ca581c7a5069d7f2ea4daf3d3ff492747e599f66","after":null,"ref":"refs/heads/gg/servre-fix-array-prompt","pushedAt":"2024-06-12T11:42:31.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"}},{"before":"dcf752707d96eb305f546526c7bc5d01f0831130","after":"704a35b183748954013bd875bbbfdd9eaca14e62","ref":"refs/heads/master","pushedAt":"2024-06-12T11:42:29.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"server : restore numeric prompts (#7883)","shortMessageHtmlLink":"server : restore numeric prompts (#7883)"}},{"before":"2ad8c4983084aecf4092367281e07cd94f4a62af","after":null,"ref":"refs/heads/revert-7630-Fix-intel-docker","pushedAt":"2024-06-12T09:05:38.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"}},{"before":"f2b5764beb35583295e2475479c18f249b139b58","after":"dcf752707d96eb305f546526c7bc5d01f0831130","ref":"refs/heads/master","pushedAt":"2024-06-12T09:05:35.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"},"commit":{"message":"update intel docker oneapi-basekit to 2024.1.1-devel-ubuntu22.04 (#7894)\n\nIn addition this reverts a workaround we had to do to workaround the upstream issue with expired intel GPG package keys in 2024.0.1-devel-ubuntu22.04","shortMessageHtmlLink":"update intel docker oneapi-basekit to 2024.1.1-devel-ubuntu22.04 (#7894)"}},{"before":"e06659811e118462b22ed5c16ed3106544d522d9","after":"a54b791211823f0c0cbf74aa317c09e501440967","ref":"refs/heads/sl/blas-backend","pushedAt":"2024-06-12T08:32:20.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"},"commit":{"message":"Apply suggestions from code review\n\nCo-authored-by: Georgi Gerganov ","shortMessageHtmlLink":"Apply suggestions from code review"}},{"before":"f2b5764beb35583295e2475479c18f249b139b58","after":"2ad8c4983084aecf4092367281e07cd94f4a62af","ref":"refs/heads/revert-7630-Fix-intel-docker","pushedAt":"2024-06-12T07:52:36.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"airMeng","name":"Meng, Hengyu","path":"/airMeng","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/39229107?s=80&v=4"},"commit":{"message":"update intel docker","shortMessageHtmlLink":"update intel docker"}},{"before":"c21d6035d961d43426d1414a14a40ea77d64968e","after":"f2b5764beb35583295e2475479c18f249b139b58","ref":"refs/heads/revert-7630-Fix-intel-docker","pushedAt":"2024-06-12T07:51:59.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"airMeng","name":"Meng, Hengyu","path":"/airMeng","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/39229107?s=80&v=4"},"commit":{"message":"Fix a typo and add Fedora 40 pacakge to install for Vulkan (#7794) [no ci]\n\nFix \"appropiate\" to \"appropriate\" and add Fedora 40 packages to install to compile with Vulkan support","shortMessageHtmlLink":"Fix a typo and add Fedora 40 pacakge to install for Vulkan (#7794) [n…"}},{"before":"46b6d3132434f61641c0f7837744837a9993d894","after":null,"ref":"refs/heads/sl/fix-vk-view-extra","pushedAt":"2024-06-12T07:38:19.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"}},{"before":null,"after":"c21d6035d961d43426d1414a14a40ea77d64968e","ref":"refs/heads/revert-7630-Fix-intel-docker","pushedAt":"2024-06-12T07:33:42.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"airMeng","name":"Meng, Hengyu","path":"/airMeng","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/39229107?s=80&v=4"},"commit":{"message":"Revert \"[SYCL] fix intel docker (#7630)\"\n\nThis reverts commit 3854c9d07f67de7f8cd6d86117bfaef47549b05a.","shortMessageHtmlLink":"Revert \"[SYCL] fix intel docker (#7630)\""}},{"before":"73bac2b11d7d3e20982fc9ee607625836387db8b","after":"f2b5764beb35583295e2475479c18f249b139b58","ref":"refs/heads/master","pushedAt":"2024-06-12T01:18:16.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"},"commit":{"message":"Fix a typo and add Fedora 40 pacakge to install for Vulkan (#7794) [no ci]\n\nFix \"appropiate\" to \"appropriate\" and add Fedora 40 packages to install to compile with Vulkan support","shortMessageHtmlLink":"Fix a typo and add Fedora 40 pacakge to install for Vulkan (#7794) [n…"}},{"before":"ecb75b5f54cab6ca7f77ec51eb5f7d87c87be6cd","after":"e06659811e118462b22ed5c16ed3106544d522d9","ref":"refs/heads/sl/blas-backend","pushedAt":"2024-06-11T21:35:45.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"},"commit":{"message":"fixes","shortMessageHtmlLink":"fixes"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEZGFQkgA","startCursor":null,"endCursor":null}},"title":"Activity · ggerganov/llama.cpp"}