Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
10
Nicholas Stranges
nstranges
Follow
0 followers
·
1 following
strangeman99
AI & ML interests
Reinforcement learning, robotics, LLM agents.
Recent Activity
liked
a dataset
28 days ago
open-r1/DAPO-Math-17k-Processed
updated
a model
about 1 month ago
nstranges/smollm2-finetuned-chat-instruct-lora-adapters
published
a model
about 1 month ago
nstranges/smollm2-finetuned-chat-instruct-lora-adapters
View all activity
Organizations
None yet
nstranges
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
28 days ago
open-r1/DAPO-Math-17k-Processed
Viewer
•
Updated
Nov 10, 2025
•
34.8k
•
5.16k
•
53
updated
a model
about 1 month ago
nstranges/smollm2-finetuned-chat-instruct-lora-adapters
Updated
Nov 22, 2025
published
a model
about 1 month ago
nstranges/smollm2-finetuned-chat-instruct-lora-adapters
Updated
Nov 22, 2025
updated
a model
about 1 month ago
nstranges/CSC2516-HW10-Original-Model
0.1B
•
Updated
Nov 21, 2025
•
3
published
a model
about 1 month ago
nstranges/CSC2516-HW10-Original-Model
0.1B
•
Updated
Nov 21, 2025
•
3
liked
a dataset
about 1 month ago
trl-lib/tldr
Viewer
•
Updated
Jan 8, 2025
•
130k
•
3.63k
•
30
liked
a model
about 1 month ago
meta-llama/Llama-3.2-1B
Text Generation
•
1B
•
Updated
Oct 24, 2024
•
1.72M
•
2.24k
liked
a dataset
about 2 months ago
HuggingFaceH4/aime_2024
Viewer
•
Updated
Jan 26, 2025
•
30
•
36.8k
•
58
liked
a model
3 months ago
Qwen/Qwen3-4B-Thinking-2507
Text Generation
•
4B
•
Updated
Aug 6, 2025
•
466k
•
•
505
liked
2 datasets
3 months ago
allenai/RLVR-MATH
Viewer
•
Updated
Nov 20, 2024
•
7.5k
•
119
•
18
osunlp/Mind2Web
Viewer
•
Updated
Oct 19, 2025
•
253
•
1.73k
•
114
updated
a model
3 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-Random-V2
8B
•
Updated
Sep 21, 2025
•
4
published
a model
3 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-Random-V2
8B
•
Updated
Sep 21, 2025
•
4
updated
a model
4 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel-V2
8B
•
Updated
Sep 12, 2025
•
5
published
a model
4 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel-V2
8B
•
Updated
Sep 12, 2025
•
5
updated
a model
4 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel
8B
•
Updated
Aug 26, 2025
•
4
published
a model
4 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel
8B
•
Updated
Aug 26, 2025
•
4
updated
a model
4 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-V2
8B
•
Updated
Aug 25, 2025
•
6
published
a model
4 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-V2
8B
•
Updated
Aug 25, 2025
•
6
updated
a model
4 months ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta0.0-V2
8B
•
Updated
Aug 25, 2025
•
4
Load more