alphaXiv commited on
Commit
3b85a56
·
verified ·
1 Parent(s): a34a1d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -3
README.md CHANGED
@@ -1,3 +1,64 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: mit
4
+ tags:
5
+ - trm
6
+ - recursive-reasoning
7
+ - sudoku
8
+ - pytorch
9
+ - huggingface
10
+ datasets:
11
+ - custom
12
+ metrics:
13
+ - accuracy
14
+ widget:
15
+ - text: "Sample sudoku puzzle here"
16
+ ---
17
+
18
+ # TRM Model for Sudoku Solving
19
+
20
+ ## Model Description
21
+
22
+ This is a Tiny Recursive Model (TRM) fine-tuned for solving Sudoku puzzles. The model uses recursive reasoning to fill in missing numbers in Sudoku grids.
23
+
24
+ - **Developed by:** alphaXiv
25
+ - **Model type:** TRM-MLP
26
+ - **Language(s) (NLP):** N/A (grid-based reasoning)
27
+ - **License:** MIT
28
+ - **Finetuned from model:** Custom TRM architecture
29
+
30
+ ## Intended Use
31
+
32
+ ### Primary Use
33
+
34
+ This model is designed to solve Sudoku puzzles by predicting the correct numbers for empty cells in standard 9x9 Sudoku grids.
35
+
36
+ ### Out-of-Scope Use
37
+
38
+ Not intended for general NLP tasks, image processing, or other puzzle types.
39
+
40
+ ## Limitations and Bias
41
+
42
+ - Trained only on standard 9x9 Sudoku puzzles
43
+ - May not handle non-standard Sudoku variants
44
+ - Performance depends on puzzle difficulty
45
+
46
+ ## Training Data
47
+
48
+ The model was trained on a dataset of Sudoku puzzles with extreme difficulty levels. The dataset includes:
49
+ - Partially filled 9x9 grids
50
+ - Correct solutions
51
+ - Difficulty ratings
52
+
53
+ ## Evaluation Results
54
+
55
+ | Variant | Metric | Claimed | Achieved |
56
+ |---------|--------|---------|----------|
57
+ | TRM-MLP | Accuracy | 87.4% | 79.37% ± 0.12% |
58
+ | TRM-Attention | Accuracy | 74.7% | 73.66% ± 0.13% |
59
+
60
+ Results from independent reproduction study.
61
+
62
+ ## Repository
63
+
64
+ https://github.com/alphaXiv/TinyRecursiveModels