AI & ML interests

AGI, LLMs, Knowledge Graph, Palmyra, Domain Specific LLM

Recent Activity

Articles

tperesย 
posted an update 3 months ago
view post
Post
223
Introducing Palmyra-mini: Compact AI Models for Efficient Inference

The Palmyra-mini family from Writer includes three lightweight models designed for high performance and efficient inference. These models are ideal for developers looking to integrate AI capabilities without excessive computational overhead.

Model Variants

* palmyra-mini: A base model for general-purpose generative tasks, achieving 52.6% on Big Bench Hard (exact match).

* palmyra-mini-thinking-a: Optimized for complex logical reasoning with a Chain of Thought (CoT) approach, scoring 82.87% on GSM8K (strict match).

* palmyra-mini-thinking-b: Specialized for mathematical reasoning, achieving 92.5% on AMC23.

Technical Details

* All models are based on the Qwen architecture, compatible with popular inference frameworks like vLLM, SGLang, and TGI.

* "Thinking" models utilize CoT training for enhanced reasoning capabilities.

* GGUF and MLX quantizations are available for optimized performance.

For more information, including benchmark methodologies and detailed performance metrics, refer to our blog post: (https://huggingface.co/blog/Writer/announcing-palmyra-mini).

Model repos can be found here:
* Writer/palmyra-mini
* Writer/palmyra-mini-thinking-a
* Writer/palmyra-mini-thinking-b

Also check out a mobile implementation of palmyra-mini on iOS here to see a to see a working example of how inference can be incorporated on-device.(https://github.com/tsperes/palmyra-mini-mobile/)
samjulienย 
posted an update about 1 year ago
view post
Post
1575
๐Ÿ”ฅ RAG in just a few lines of code?!

Try out our Hacker News Listener with new built-in RAG capabilities and Palmyra X 004 from the team at Writer!

This Writer Framework app:

- Scrapes up to 500 HN stories and comments
- Uploads them to a Knowledge Graph
- Enables interactive chat with the content using graph-based RAG
- Provides source attribution with every response

The best part? Setting up RAG is now incredibly simple - just a few lines of code to connect your Knowledge Graph as a tool with Palmyra X 004.

๐Ÿค— Space: samjulien/hacker-news-listener
๐Ÿ’ป Code: https://github.com/writer/framework-tutorials/tree/main/hacker-news-social-listener
samjulienย 
posted an update over 1 year ago
view post
Post
1996
๐Ÿ”ฅ Today, Writer dropped Palmyra-Med-70b and Palmyra-Fin-70b, two new domain-specific models that are setting a new standard for medical and financial model performance.

TL;DR
Palmyra-Med-70b
๐Ÿ”ข 8k and 32k versions available
๐Ÿš€ MMLU performance of ~86%, outperforming other top models
๐Ÿ‘จโ€โš•๏ธ Great for diagnosing, planning treatments, medical research, insurance coding and billing
๐Ÿ“ƒ Open-model license for non-commercial use cases
๐Ÿค— Available on Hugging Face: Writer/Palmyra-Med-70B
๐Ÿ’พ Live on NVIDIA NIM: https://build.nvidia.com/writer/palmyra-med-70b

Palmyra-Fin-70b
๐Ÿš€ Passed the CFA Level III exam with a 73% score โ€” the first model to do so
๐Ÿ’ธ Skilled at complex tasks like investment research, financial analysis, and sentiment analysis
๐Ÿ“ˆ Outperformed other top models on a long-fin-eval test of real-world use cases
๐Ÿ“ƒ Open-model license for non-commercial use cases
๐Ÿค— Available on Hugging Face: Writer/Palmyra-Fin-70B-32K
๐Ÿ’พ Live on NVIDIA NIM: https://build.nvidia.com/writer/palmyra-fin-70b-32k

Try them out and let us know what you think!
  • 2 replies
ยท