Update server instructions for web front end (#2103)

Co-authored-by: Jesse Johnson <thatguy@jessejojojohnson.com>
2025-02-21 15:30:00 +00:00 · 2023-07-05 15:13:35 +00:00 · 2023-07-05 15:13:35 +00:00 · 8567c76b53
commit 8567c76b53
parent 924dd22fd3
1 changed files with 3 additions and 2 deletions
--- a/examples/server/README.md
+++ b/examples/server/README.md
@ -1,6 +1,6 @@
 # llama.cpp/example/server

-This example demonstrates a simple HTTP API server to interact with llama.cpp.
+This example demonstrates a simple HTTP API server and a simple web front end to interact with llama.cpp.

 Command line options:

@ -21,6 +21,7 @@ Command line options:
 -   `-to N`, `--timeout N`: Server read/write timeout in seconds. Default `600`.
 -   `--host`: Set the hostname or ip address to listen. Default `127.0.0.1`.
 -   `--port`: Set the port to listen. Default: `8080`.
+-   `--public`: path from which to serve static files (default examples/server/public)
 -   `--embedding`: Enable embedding extraction, Default: disabled.

 ## Build
@ -59,7 +60,7 @@ server.exe -m models\7B\ggml-model.bin -c 2048
 ```

 The above command will start a server that by default listens on `127.0.0.1:8080`.
-You can consume the endpoints with Postman or NodeJS with axios library.
+You can consume the endpoints with Postman or NodeJS with axios library. You can visit the web front end at the same url.

 ## Testing with CURL