Introducing Palmyra-mini: Compact AI Models for Efficient Inference
The Palmyra-mini family from Writer includes three lightweight models designed for high performance and efficient inference. These models are ideal for developers looking to integrate AI capabilities without excessive computational overhead.
Model Variants
* palmyra-mini: A base model for general-purpose generative tasks, achieving 52.6% on Big Bench Hard (exact match).
* palmyra-mini-thinking-a: Optimized for complex logical reasoning with a Chain of Thought (CoT) approach, scoring 82.87% on GSM8K (strict match).
* palmyra-mini-thinking-b: Specialized for mathematical reasoning, achieving 92.5% on AMC23.
Technical Details
* All models are based on the Qwen architecture, compatible with popular inference frameworks like vLLM, SGLang, and TGI.
* "Thinking" models utilize CoT training for enhanced reasoning capabilities.
* GGUF and MLX quantizations are available for optimized performance.
Also check out a mobile implementation of palmyra-mini on iOS here to see a to see a working example of how inference can be incorporated on-device.(https://github.com/tsperes/palmyra-mini-mobile/)
Try out our Hacker News Listener with new built-in RAG capabilities and Palmyra X 004 from the team at Writer!
This Writer Framework app:
- Scrapes up to 500 HN stories and comments - Uploads them to a Knowledge Graph - Enables interactive chat with the content using graph-based RAG - Provides source attribution with every response
The best part? Setting up RAG is now incredibly simple - just a few lines of code to connect your Knowledge Graph as a tool with Palmyra X 004.
๐ฅ Today, Writer dropped Palmyra-Med-70b and Palmyra-Fin-70b, two new domain-specific models that are setting a new standard for medical and financial model performance.
TL;DR Palmyra-Med-70b ๐ข 8k and 32k versions available ๐ MMLU performance of ~86%, outperforming other top models ๐จโโ๏ธ Great for diagnosing, planning treatments, medical research, insurance coding and billing ๐ Open-model license for non-commercial use cases ๐ค Available on Hugging Face: Writer/Palmyra-Med-70B ๐พ Live on NVIDIA NIM: https://build.nvidia.com/writer/palmyra-med-70b
Palmyra-Fin-70b ๐ Passed the CFA Level III exam with a 73% score โ the first model to do so ๐ธ Skilled at complex tasks like investment research, financial analysis, and sentiment analysis ๐ Outperformed other top models on a long-fin-eval test of real-world use cases ๐ Open-model license for non-commercial use cases ๐ค Available on Hugging Face: Writer/Palmyra-Fin-70B-32K ๐พ Live on NVIDIA NIM: https://build.nvidia.com/writer/palmyra-fin-70b-32k