Nicholas Stranges

nstranges

strangeman99

AI & ML interests

Reinforcement learning, robotics, LLM agents.

Recent Activity

liked a dataset 28 days ago

open-r1/DAPO-Math-17k-Processed

updated a model about 1 month ago

nstranges/smollm2-finetuned-chat-instruct-lora-adapters

published a model about 1 month ago

nstranges/smollm2-finetuned-chat-instruct-lora-adapters

View all activity

Organizations

None yet

liked a dataset 28 days ago

open-r1/DAPO-Math-17k-Processed

Viewer • Updated Nov 10, 2025 • 34.8k • 5.16k • 53

updated a model about 1 month ago

nstranges/smollm2-finetuned-chat-instruct-lora-adapters

Updated Nov 22, 2025

published a model about 1 month ago

nstranges/smollm2-finetuned-chat-instruct-lora-adapters

Updated Nov 22, 2025

updated a model about 1 month ago

nstranges/CSC2516-HW10-Original-Model

0.1B • Updated Nov 21, 2025 • 3

published a model about 1 month ago

nstranges/CSC2516-HW10-Original-Model

0.1B • Updated Nov 21, 2025 • 3

liked a dataset about 1 month ago

trl-lib/tldr

Viewer • Updated Jan 8, 2025 • 130k • 3.63k • 30

liked a model about 1 month ago

meta-llama/Llama-3.2-1B

Text Generation • 1B • Updated Oct 24, 2024 • 1.72M • 2.24k

liked a dataset about 2 months ago

HuggingFaceH4/aime_2024

Viewer • Updated Jan 26, 2025 • 30 • 36.8k • 58

liked a model 3 months ago

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6, 2025 • 466k • • 505

liked 2 datasets 3 months ago

allenai/RLVR-MATH

Viewer • Updated Nov 20, 2024 • 7.5k • 119 • 18

osunlp/Mind2Web

Viewer • Updated Oct 19, 2025 • 253 • 1.73k • 114

updated a model 3 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-Random-V2

8B • Updated Sep 21, 2025 • 4

published a model 3 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-Random-V2

8B • Updated Sep 21, 2025 • 4

updated a model 4 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel-V2

8B • Updated Sep 12, 2025 • 5

published a model 4 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel-V2

8B • Updated Sep 12, 2025 • 5

updated a model 4 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel

8B • Updated Aug 26, 2025 • 4

published a model 4 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel

8B • Updated Aug 26, 2025 • 4

updated a model 4 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-V2

8B • Updated Aug 25, 2025 • 6

published a model 4 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-V2

8B • Updated Aug 25, 2025 • 6

updated a model 4 months ago

nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta0.0-V2

8B • Updated Aug 25, 2025 • 4

Nicholas Stranges

AI & ML interests

Recent Activity

Organizations

nstranges's activity