Overview

Gen AI stuff

Tech-stack

A lightweight OpenAI API compatible server: av_connect http server in C++
Text-generation: llama.cp
Web UI: Provide a simple web UI interface to explore/experiment (borrowed from @llama.cpp project)

A snapshot

Quick started

Obtain the latest container from docker hub

** Currently, the docker build is quite outdated build **

docker image pull harryavble/av_llm

Access to Web interface at http://127.0.0.1:8080

Supported model

LLaMA 1
LLaMA 2
LLaMA 3
Mistral-7B
Mixtral MoE
DBRX
Falcon
Chinese-LLaMA-Alpaca This application is built on the top of llama.cpp, so it should work any model which the llama.cpp supports

Download model and run

docker run -p 8080:8080 -v $your_host_model_folder:/work/model av_llm ./av_llm -m /work/model/$your_model_file

Compile and run

$ cmake -B build && cmake --build build
$ build/av_llm -m <path to gguf file>

UI

Should work with below UI

huggingface/chat-ui

Future work

Support more LLM models
Support more OpenAI API server
Support more application

Note

This is demonstration version, some issues or error checking is not fully validated.
Contact me via avble.harry dot gmail.com if any

Reference

https://platform.openai.com/docs/api-reference/introduction

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
image		image
public		public
simplechat		simplechat
webui		webui
CMakeLists.txt		CMakeLists.txt
external.cmake		external.cmake
log.hpp		log.hpp
main.cpp		main.cpp
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Tech-stack

A snapshot

Quick started

Supported model

Download model and run

Compile and run

UI

Future work

Note

Reference

About

Releases

Packages

Languages

avble/av_llm

Folders and files

Latest commit

History

Repository files navigation

Overview

Tech-stack

A snapshot

Quick started

Supported model

Download model and run

Compile and run

UI

Future work

Note

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages