GGUF
How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf IkariDev/Athena-v4-GGUF:
# Run inference directly in the terminal:
llama-cli -hf IkariDev/Athena-v4-GGUF:
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf IkariDev/Athena-v4-GGUF:
# Run inference directly in the terminal:
llama-cli -hf IkariDev/Athena-v4-GGUF:
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf IkariDev/Athena-v4-GGUF:
# Run inference directly in the terminal:
./llama-cli -hf IkariDev/Athena-v4-GGUF:
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf IkariDev/Athena-v4-GGUF:
# Run inference directly in the terminal:
./build/bin/llama-cli -hf IkariDev/Athena-v4-GGUF:
Use Docker
docker model run hf.co/IkariDev/Athena-v4-GGUF:
Quick Links

image/png

Experimental Athena v4 model. Use Alpaca format. Suitable for RP, ERP and general stuff.

I should state here that this is a HIGHLY experimental model!

Description

This repo contains GGUF files of Athena-V4.

GGUF - By TheBloke

GPTQ - By TheBloke

exl2 - by waldie

AWQ - By TheBloke

fp16 - by IkariDev+Undi95

OLD(GGUF - by IkariDev+Undi95)

Ratings:

Note: I have permission of all users to upload their ratings, i DONT screenshot random reviews without asking if i can put them here!

image/png

If you want your rating to be here, send me a message over on DC and ill put up a screenshot of it here. DC name is "ikaridev".

Models+loras used and recipe

  • Athena-v3
  • Xwin-LM/Xwin-LM-13B-V0.1
  • Undi95/PsyMedRP-v1-13B
  • cgato/Thespis-13b-v0.2
  • jondurbin/airoboros-l2-13b-3.0
Athena-v4-tmp1 = [ Athena-v3(0.85)+Xwin-LM/Xwin-LM-13B-V0.1(0.15) ]
Athena-v4-tmp2 = [ Undi95/PsyMedRP-v1-13B(0.55)+cgato/Thespis-13b-v0.2(0.45) ]

Athena-v4-tmp3 = Athena-v4-tmp1(0.55) + Athena-v4-tmp2(0.35)

Athena-v4 = Athena-v4-tmp3 + jondurbin/airoboros-l2-13b-3.0(0.1)

Prompt template: Alpaca

Below is an instruction that describes a task. Write a response that appropriately completes the request.

### Instruction:
{prompt}

### Response:

Thanks to Undi95 for providing the machine for Athena v2 and Athena v3, and giving me infos about how things work. Going forward i will use a merging server provided by a friend.

Downloads last month
66
GGUF
Model size
13B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

3-bit

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support