llama.cpp

aditya/llama.cpp

Fork 0

mirror of https://git.adityakumar.xyz/llama.cpp.git synced 2025-02-22 07:40:00 +00:00

f4f5362edb

Update README.md (#444) Gary Mulder 2023-03-24 15:23:09 +0000
863f65e2e3

fix instruct mode (#445) rabidcopy 2023-03-24 10:22:39 -0500
afd220d9c6

Properly free llama_context on failure Georgi Gerganov 2023-03-24 17:21:01 +0200
481044d50c

additional optimizations for POWER9 (#454) Cameron Kaiser 2023-03-24 08:19:26 -0700
563cdc391d

Support calling mlock() on loaded model data on Linux and macOS (#453) comex 2023-03-24 08:19:05 -0700
8d4a855c24

Add embedding mode with arg flag. Currently working (#282) Luciano 2023-03-24 08:05:13 -0700
b6b268d441

Add link to Roadmap discussion Georgi Gerganov 2023-03-24 09:13:35 +0200
3cd8dde0d1 Revert "Fix memory allocation issues and seg faults" Georgi Gerganov 2023-03-24 06:22:28 +0200
4870e455b3

Fix memory allocation issues and seg faults Georgi Gerganov 2023-03-24 00:11:53 +0200
483bab2e3d

Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439) Georgi Gerganov 2023-03-23 23:22:01 +0200
404e1da38e

Fix quantize script not finding models in parent directory (#428) Jed Fox 2023-03-23 16:42:52 -0400
4cc053b6d5

Remove oboslete command from Docker script Georgi Gerganov 2023-03-23 22:39:44 +0200
0ba5a3a9a5

Obsolete Georgi Gerganov 2023-03-23 22:32:02 +0200
2e17dfd80a

Replace EOS with newline to prevent context/memory being flushed by EOS in interactive mode (#333) rabidcopy 2023-03-23 15:22:47 -0500
20a1a4e09c

Fix GPTQ converter (#423) Timmy Knight 2023-03-23 10:18:13 -1000
ad072fc5ad

Generate library with CMake (#430) nusu-github 2023-03-24 05:16:48 +0900
ea10d3ded2

Command line args bounds checking (#424) anzz1 2023-03-23 19:54:28 +0200
a18c19259a Fix Nix build Ben Siraphob 2023-03-22 00:37:02 -0500
a50e39c6fe

Revert "Delete SHA256SUMS for now" (#429) Stephan Walter 2023-03-23 14:15:48 +0000
a140219e81

Fix Makefile echo escape codes (by removing them). (#418) Kerfuffle 2023-03-23 05:41:32 -0600
8a3e5ef801

Move model section from issue template to README.md (#421) Gary Mulder 2023-03-23 11:30:40 +0000
8eea5ae0e5

Delete SHA256SUMS for now (#416) anzz1 2023-03-23 12:26:19 +0200
93208cfb92

Adjust repetition penalty .. Georgi Gerganov 2023-03-23 10:46:58 +0200
03ace14cfd

Add link to recent podcast about whisper.cpp and llama.cpp Georgi Gerganov 2023-03-23 09:48:51 +0200
e4412b45e3

CI: CMake: Separate build and test steps (#376) anzz1 2023-03-23 04:20:34 +0200
f7dc43bc0d

Fix instruct mode broken by PR #354 (#409) tjohnman 2023-03-23 01:30:23 +0100
ee8a788786

Update issue template so people will use it (#404) Gary Mulder 2023-03-22 19:06:18 +0000
69c92298a9

Deduplicate q4 quantization functions (#383) Stephan Walter 2023-03-22 17:29:06 +0000
97940520e8

fix: add POSIX functionality for Linux compilation (#51) Valentyn Bezshapkin 2023-03-22 18:20:25 +0100
305ba6f0e6

Don't force immediate interactive without -i (#354) tjohnman 2023-03-22 18:16:35 +0100
4122dffff9

cmake: make llama an actual library (#392) Erik Scholz 2023-03-22 17:37:10 +0100
56e659a0b2

fix perplexity after c-api refactor (#390) Erik Scholz 2023-03-22 17:09:38 +0100
40ea807a97

Add details on perplexity to README.md (#395) Gary Linscott 2023-03-22 08:53:54 -0700
d5850c53ca

Add missing header for memcpy (#386) Yusuf Kağan Hanoğlu 2023-03-22 11:55:45 +0300
ae44e23ee3

When seed <= 0 - use the clock to generate one Georgi Gerganov 2023-03-22 07:47:15 +0200
928480ef5b

Init llama_context_params properly from CLI (#370) Georgi Gerganov 2023-03-22 07:45:00 +0200
56817b1f88

Remove temporary notice and update hot topics Georgi Gerganov 2023-03-22 07:34:02 +0200
f5a77a629b

Introduce C-style API (#370) Georgi Gerganov 2023-03-22 07:32:36 +0200
da0e9fe90c Add SHA256SUMS file and instructions to README how to obtain and verify the downloads Gary Mulder 2023-03-20 20:14:06 +0000
e6c9e0986c Fix bin dir for win ci anzz1 2023-03-21 23:49:24 +0200
01a297b099

specify build type for ctest on windows (#371) Erik Scholz 2023-03-21 22:34:25 +0100
3366853e41

Add notice about pending change Georgi Gerganov 2023-03-21 22:57:35 +0200
3f9c6135e4

fix typo in chatLLaMa (#368) Mathieu Nayrolles 2023-03-21 16:52:27 -0400
0f61352708

Update issue templates Georgi Gerganov 2023-03-21 19:47:27 +0200
353ec251a4

We could use std::unordered_map over std::map (#305) Fabio R. Sluzala 2023-03-21 14:21:50 -0300
89d5d90f3b

Fix color codes emitting mid-UTF8 code. (#312) Matvey Soloviev 2023-03-21 18:11:01 +0100
16ffc013c6

Importer for GPTQ quantized LLaMA models (#301) comex 2023-03-21 09:42:25 -0700
486ae645fd

Compute perplexity over prompt (#270) Gary Linscott 2023-03-21 09:27:42 -0700
3ab3e6582f

Add chatLLaMa script (#198) Jean-Christophe Hoelt 2023-03-21 18:23:15 +0200
f157088cb7

makefile: Fix CPU feature detection on Haiku (#218) Alex von Gluck IV 2023-03-21 11:21:06 -0500
c86ba036e6

Enable ANSI colors on Windows 10+ (#311) anzz1 2023-03-21 18:14:46 +0200
1daf4dd712

Minor style changes Georgi Gerganov 2023-03-21 18:10:32 +0200
dc6a845b85

Add chat.sh script Georgi Gerganov 2023-03-21 18:09:37 +0200
6a612959e1

Check for reverse prompt by characters instead of tokens (#292) (#330) tjohnman 2023-03-21 17:05:06 +0100
d5f56a5e5a

Check for reverse prompt by characters instead of tokens (#292) (#330) tjohnman 2023-03-21 17:04:43 +0100
3bfa3b43b7

Fix convert script, warnings alpaca instructions, default params Georgi Gerganov 2023-03-21 17:59:16 +0200
715d292ee0

Add OpenBSD support (#314) Kevin Lo 2023-03-21 09:50:09 -0600
c98ae02668

fix typo in comment (#318) Mack Straight 2023-03-21 08:49:43 -0700
c3b2306b18

Makefile: slightly cleanup for Mac Intel; echo instead of run ./main -h (#335) Qingyou Meng 2023-03-21 23:44:11 +0800
975d2cebf9

cmdline option for custom amount of model parts (--n_parts N) (#348) anzz1 2023-03-21 17:42:43 +0200
e0ffc861fa

Update IPFS links to quantized alpaca with new tokenizer format (#352) Kevin Kwok 2023-03-21 08:34:49 -0700
8f644a0a85

Change default repeat_penalty to 1.0 Georgi Gerganov 2023-03-21 17:32:14 +0200
eb34620aec

Add tokenizer test + revert to C++11 (#355) Georgi Gerganov 2023-03-21 17:29:41 +0200
2e664f1ff4

Add initial AVX512 support for dot product on Linux (#320) Casey Primozic 2023-03-21 07:35:42 -0700
8cf9f34edd

Adding missing features of CMakeLists.txt & Refactoring (#131) nusu-github 2023-03-21 09:37:16 +0900
bd4b46d6ba Nix flake: set meta.mainProgram to llama Ben Siraphob 2023-03-20 16:44:30 -0500
6b6d5b5024

Fixed tokenizer.model not found error when model dir is symlink (#325) Qingyou Meng 2023-03-21 03:33:10 +0800
a791a68b61

move file magic/version to header, print expected version (#319) Mack Straight 2023-03-20 12:26:01 -0700
0f1b21cb90

Docker - Fix publish docker image in GitHub Registry (#235) Bernat Vadell 2023-03-20 18:05:20 +0100
074bea2eb1

sentencepiece bpe compatible tokenizer (#252) Mack Straight 2023-03-20 03:17:23 -0700
5cb63e2493

Add tqdm to Python requirements (#293) Stephan Walter 2023-03-20 08:24:11 +0000
da5303c1ea

bugfix: default should not be interactive (#304) cocktailpeanut 2023-03-19 17:44:20 -0400
4545539d71

Rename script Georgi Gerganov 2023-03-19 21:58:51 +0200
edeba28366

Add temporary helper script for Alpaca chat Georgi Gerganov 2023-03-19 21:57:28 +0200
5c19c70ba6

fix coloring of last n_batch of prompt, and refactor line input (#221) Rickey Bowers Jr 2023-03-19 13:44:30 -0600
24568371ae

Support for multiple reverse prompts. (#299) tjohnman 2023-03-19 20:33:06 +0100
7392f1cd2c

Improved quantize script (#222) Suaj Carrot 2023-03-19 12:38:44 -0600
ad5fd5b60c

Make prompt randomization optional. (#300) tjohnman 2023-03-19 19:36:19 +0100
368d0c8a9e

Respect the maximum number of tokens in interactive. (#298) tjohnman 2023-03-19 19:31:17 +0100
50fae10d03

Add --ignore-eos parameter (#181) slaren 2023-03-19 19:22:48 +0100
084e2f0ec0

interactive mode: print '\n' in sigint_handler, this flush stdout thus ensure color reset. (#283) Qingyou Meng 2023-03-20 02:10:00 +0800
0b366e7357

Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294) Erik Scholz 2023-03-19 18:57:00 +0100
160bfb217d

Update hot topics to mention Alpaca support Georgi Gerganov 2023-03-19 19:51:55 +0200
c494ed5b94

Fix off-by-one bug (#115) Georgi Gerganov 2023-03-19 19:46:32 +0200
c1c7026b47

Fix python stuff (#109) Georgi Gerganov 2023-03-19 19:33:18 +0200
467b149761

Refactoring convert-pth-to-ggml.py: more concise and readable (#109) qunash 2023-03-19 20:17:39 +0300
70f01cb863

Drop trailing new line from file prompts (#80) Georgi Gerganov 2023-03-19 19:04:44 +0200
a4e63b73df

Add instruction for using Alpaca (#240) Georgi Gerganov 2023-03-19 18:49:50 +0200
9e1707218a

Add "--instruct" argument for usage with Alpaca (#240) Georgi Gerganov 2023-03-19 18:37:02 +0200
22213a17b5

Change RMSNorm eps to 1e-6 (#173) Georgi Gerganov 2023-03-19 17:30:00 +0200
d7def1a752

Warn user if a context size greater than 2048 tokens is specified (#274) Ronsor 2023-03-18 17:10:47 -0700
6f61c18ec9 Fix typo in readme Pavol Rusnak 2023-03-18 22:39:46 +0100
1e5a6d088d Add note about Python 3.11 to readme Pavol Rusnak 2023-03-18 22:20:04 +0100
554b541521 Add memory/disk requirements to readme Pavol Rusnak 2023-03-18 21:58:46 +0100
d3f202d57b

Remove unused code since n_vocab is model.hparams.n_vocab (#262) Alex Nguyen 2023-03-18 20:51:49 +0700
e03e359730

fixed warning with std::ignore about unused function result (#151) Justin Suess 2023-03-18 07:44:09 -0400
a81d0c2a17

Fix n^2 loop in tokenization (#254) Gary Linscott 2023-03-18 04:17:19 -0700
b2de7f18df

CI Improvements (#230) anzz1 2023-03-18 09:27:12 +0200
a292747893

Nix flake (#40) Niklas Korz 2023-03-17 23:03:48 +0100
c9f670a177

Implement non-greedy tokenizer that tries to maximize token lengths (#242) thement 2023-03-17 21:05:58 +0100

Commit graph Select branches Hide pull requests master Mono Color

Commit graph

Select branches

Hide pull requests

master