AbstractPhil commited on
Commit
4449dd1
Β·
verified Β·
1 Parent(s): 1102229

Create immediate_feed.txt

Browse files
Files changed (1) hide show
  1. immediate_feed.txt +383 -0
immediate_feed.txt ADDED
@@ -0,0 +1,383 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ============================================================
2
+ CantorLinear ImageNet CLIP Features Training
3
+ ============================================================
4
+
5
+ Configuration:
6
+ Dataset: AbstractPhil/imagenet-clip-features-orderly
7
+ CLIP dim: 512
8
+ Hidden dims: Direct
9
+ Cantor depth: 8
10
+ Batch size: 512
11
+ Learning rate: 0.001
12
+ Device: cuda
13
+
14
+ Loading dataset...
15
+ clip_vit_b16/train-00000-of-00006.parque(…): 100%
16
+  571M/571M [00:09<00:00, 192MB/s]
17
+ clip_vit_b16/train-00001-of-00006.parque(…): 100%
18
+  571M/571M [00:05<00:00, 144MB/s]
19
+ clip_vit_b16/train-00002-of-00006.parque(…): 100%
20
+  571M/571M [00:09<00:00, 59.9MB/s]
21
+ clip_vit_b16/train-00003-of-00006.parque(…): 100%
22
+  571M/571M [00:08<00:00, 116MB/s]
23
+ clip_vit_b16/train-00004-of-00006.parque(…): 100%
24
+  571M/571M [00:07<00:00, 194MB/s]
25
+ clip_vit_b16/train-00005-of-00006.parque(…): 100%
26
+  571M/571M [00:08<00:00, 127MB/s]
27
+ clip_vit_b16/validation-00000-of-00001.p(…): 100%
28
+  134M/134M [00:03<00:00, 98.9MB/s]
29
+ clip_vit_b16/test-00000-of-00001.parquet: 100%
30
+  267M/267M [00:05<00:00, 158MB/s]
31
+ Generating train split: 100%
32
+  1281167/1281167 [00:08<00:00, 150853.22 examples/s]
33
+ Generating validation split: 100%
34
+  50000/50000 [00:02<00:00, 18659.96 examples/s]
35
+ Generating test split: 100%
36
+  100000/100000 [00:00<00:00, 178346.18 examples/s]
37
+ Train samples: 1153050
38
+ Val samples: 128117
39
+
40
+ Building model...
41
+ Total parameters: 513,001
42
+ Trainable parameters: 513,001
43
+ CantorLinear layers: 1
44
+ Avg mask density: 0.0391
45
+
46
+ Starting training...
47
+ Epoch 1/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:59<00:00, 18.89it/s, loss=6.8454, acc=14.31%]
48
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.88it/s]
49
+
50
+ Epoch 1/50
51
+ Train Loss: 6.8453 | Train Acc: 14.32%
52
+ Val Loss: 6.7203 | Val Acc: 34.11%
53
+ Mean Alpha: 0.5588 | Mean Density: 0.0391
54
+ βœ“ New best model saved! (Val Acc: 34.11%)
55
+ Epoch 2/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.03it/s, loss=6.4762, acc=37.85%]
56
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.86it/s]
57
+
58
+ Epoch 2/50
59
+ Train Loss: 6.4759 | Train Acc: 37.85%
60
+ Val Loss: 6.1765 | Val Acc: 39.15%
61
+ Mean Alpha: 0.5812 | Mean Density: 0.0391
62
+ βœ“ New best model saved! (Val Acc: 39.15%)
63
+ Epoch 3/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.07it/s, loss=5.7757, acc=40.62%]
64
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.74it/s]
65
+
66
+ Epoch 3/50
67
+ Train Loss: 5.7754 | Train Acc: 40.62%
68
+ Val Loss: 5.3353 | Val Acc: 42.73%
69
+ Mean Alpha: 0.6131 | Mean Density: 0.0391
70
+ βœ“ New best model saved! (Val Acc: 42.73%)
71
+ Epoch 4/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.10it/s, loss=4.8301, acc=46.62%]
72
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.50it/s]
73
+
74
+ Epoch 4/50
75
+ Train Loss: 4.8296 | Train Acc: 46.62%
76
+ Val Loss: 4.3151 | Val Acc: 51.32%
77
+ Mean Alpha: 0.6513 | Mean Density: 0.0391
78
+ βœ“ New best model saved! (Val Acc: 51.32%)
79
+ Epoch 5/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.11it/s, loss=3.7933, acc=56.48%]
80
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 19.13it/s]
81
+
82
+ Epoch 5/50
83
+ Train Loss: 3.7930 | Train Acc: 56.48%
84
+ Val Loss: 3.2999 | Val Acc: 61.08%
85
+ Mean Alpha: 0.6914 | Mean Density: 0.0391
86
+ βœ“ New best model saved! (Val Acc: 61.08%)
87
+ Epoch 6/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.03it/s, loss=2.8774, acc=64.57%]
88
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.69it/s]
89
+
90
+ Epoch 6/50
91
+ Train Loss: 2.8771 | Train Acc: 64.57%
92
+ Val Loss: 2.5271 | Val Acc: 67.17%
93
+ Mean Alpha: 0.7286 | Mean Density: 0.0391
94
+ βœ“ New best model saved! (Val Acc: 67.17%)
95
+ Epoch 7/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.03it/s, loss=2.2531, acc=68.98%]
96
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.88it/s]
97
+
98
+ Epoch 7/50
99
+ Train Loss: 2.2527 | Train Acc: 68.98%
100
+ Val Loss: 2.0361 | Val Acc: 70.25%
101
+ Mean Alpha: 0.7614 | Mean Density: 0.0391
102
+ βœ“ New best model saved! (Val Acc: 70.25%)
103
+ Epoch 8/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.08it/s, loss=1.8587, acc=71.36%]
104
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.40it/s]
105
+
106
+ Epoch 8/50
107
+ Train Loss: 1.8588 | Train Acc: 71.36%
108
+ Val Loss: 1.7251 | Val Acc: 71.92%
109
+ Mean Alpha: 0.7910 | Mean Density: 0.0391
110
+ βœ“ New best model saved! (Val Acc: 71.92%)
111
+ Epoch 9/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.05it/s, loss=1.6052, acc=72.79%]
112
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.70it/s]
113
+
114
+ Epoch 9/50
115
+ Train Loss: 1.6050 | Train Acc: 72.79%
116
+ Val Loss: 1.5215 | Val Acc: 73.01%
117
+ Mean Alpha: 0.8171 | Mean Density: 0.0391
118
+ βœ“ New best model saved! (Val Acc: 73.01%)
119
+ Epoch 10/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.17it/s, loss=1.4355, acc=73.78%]
120
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.74it/s]
121
+
122
+ Epoch 10/50
123
+ Train Loss: 1.4355 | Train Acc: 73.78%
124
+ Val Loss: 1.3817 | Val Acc: 73.82%
125
+ Mean Alpha: 0.8400 | Mean Density: 0.0391
126
+ βœ“ New best model saved! (Val Acc: 73.82%)
127
+ Epoch 11/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.19it/s, loss=1.3162, acc=74.56%]
128
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.31it/s]
129
+
130
+ Epoch 11/50
131
+ Train Loss: 1.3162 | Train Acc: 74.56%
132
+ Val Loss: 1.2818 | Val Acc: 74.49%
133
+ Mean Alpha: 0.8598 | Mean Density: 0.0391
134
+ βœ“ New best model saved! (Val Acc: 74.49%)
135
+ Epoch 12/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.12it/s, loss=1.2289, acc=75.16%]
136
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.80it/s]
137
+
138
+ Epoch 12/50
139
+ Train Loss: 1.2287 | Train Acc: 75.16%
140
+ Val Loss: 1.2072 | Val Acc: 74.97%
141
+ Mean Alpha: 0.8766 | Mean Density: 0.0391
142
+ βœ“ New best model saved! (Val Acc: 74.97%)
143
+ Epoch 13/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.13it/s, loss=1.1623, acc=75.65%]
144
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.78it/s]
145
+
146
+ Epoch 13/50
147
+ Train Loss: 1.1622 | Train Acc: 75.65%
148
+ Val Loss: 1.1495 | Val Acc: 75.39%
149
+ Mean Alpha: 0.8909 | Mean Density: 0.0391
150
+ βœ“ New best model saved! (Val Acc: 75.39%)
151
+ Epoch 14/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.08it/s, loss=1.1100, acc=76.09%]
152
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 19.00it/s]
153
+
154
+ Epoch 14/50
155
+ Train Loss: 1.1100 | Train Acc: 76.09%
156
+ Val Loss: 1.1040 | Val Acc: 75.75%
157
+ Mean Alpha: 0.9027 | Mean Density: 0.0391
158
+ βœ“ New best model saved! (Val Acc: 75.75%)
159
+ Epoch 15/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.15it/s, loss=1.0681, acc=76.46%]
160
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.85it/s]
161
+
162
+ Epoch 15/50
163
+ Train Loss: 1.0681 | Train Acc: 76.46%
164
+ Val Loss: 1.0670 | Val Acc: 76.06%
165
+ Mean Alpha: 0.9128 | Mean Density: 0.0391
166
+ βœ“ New best model saved! (Val Acc: 76.06%)
167
+ Epoch 16/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.11it/s, loss=1.0336, acc=76.79%]
168
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.90it/s]
169
+
170
+ Epoch 16/50
171
+ Train Loss: 1.0337 | Train Acc: 76.79%
172
+ Val Loss: 1.0367 | Val Acc: 76.32%
173
+ Mean Alpha: 0.9212 | Mean Density: 0.0391
174
+ βœ“ New best model saved! (Val Acc: 76.32%)
175
+ Epoch 17/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.13it/s, loss=1.0049, acc=77.06%]
176
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.68it/s]
177
+
178
+ Epoch 17/50
179
+ Train Loss: 1.0048 | Train Acc: 77.07%
180
+ Val Loss: 1.0113 | Val Acc: 76.57%
181
+ Mean Alpha: 0.9284 | Mean Density: 0.0391
182
+ βœ“ New best model saved! (Val Acc: 76.57%)
183
+ Epoch 18/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.22it/s, loss=0.9806, acc=77.32%]
184
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.67it/s]
185
+
186
+ Epoch 18/50
187
+ Train Loss: 0.9809 | Train Acc: 77.32%
188
+ Val Loss: 0.9898 | Val Acc: 76.79%
189
+ Mean Alpha: 0.9343 | Mean Density: 0.0391
190
+ βœ“ New best model saved! (Val Acc: 76.79%)
191
+ Epoch 19/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.09it/s, loss=0.9598, acc=77.55%]
192
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.82it/s]
193
+
194
+ Epoch 19/50
195
+ Train Loss: 0.9598 | Train Acc: 77.55%
196
+ Val Loss: 0.9715 | Val Acc: 76.96%
197
+ Mean Alpha: 0.9396 | Mean Density: 0.0391
198
+ βœ“ New best model saved! (Val Acc: 76.96%)
199
+ Epoch 20/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.15it/s, loss=0.9419, acc=77.75%]
200
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.83it/s]
201
+
202
+ Epoch 20/50
203
+ Train Loss: 0.9419 | Train Acc: 77.75%
204
+ Val Loss: 0.9556 | Val Acc: 77.15%
205
+ Mean Alpha: 0.9440 | Mean Density: 0.0391
206
+ βœ“ New best model saved! (Val Acc: 77.15%)
207
+ Epoch 21/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.21it/s, loss=0.9264, acc=77.92%]
208
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.74it/s]
209
+
210
+ Epoch 21/50
211
+ Train Loss: 0.9265 | Train Acc: 77.92%
212
+ Val Loss: 0.9420 | Val Acc: 77.27%
213
+ Mean Alpha: 0.9477 | Mean Density: 0.0391
214
+ βœ“ New best model saved! (Val Acc: 77.27%)
215
+ Epoch 22/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:56<00:00, 19.26it/s, loss=0.9128, acc=78.09%]
216
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.94it/s]
217
+
218
+ Epoch 22/50
219
+ Train Loss: 0.9126 | Train Acc: 78.09%
220
+ Val Loss: 0.9300 | Val Acc: 77.41%
221
+ Mean Alpha: 0.9511 | Mean Density: 0.0391
222
+ βœ“ New best model saved! (Val Acc: 77.41%)
223
+ Epoch 23/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.23it/s, loss=0.9007, acc=78.24%]
224
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.52it/s]
225
+
226
+ Epoch 23/50
227
+ Train Loss: 0.9007 | Train Acc: 78.24%
228
+ Val Loss: 0.9195 | Val Acc: 77.54%
229
+ Mean Alpha: 0.9540 | Mean Density: 0.0391
230
+ βœ“ New best model saved! (Val Acc: 77.54%)
231
+ Epoch 24/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.17it/s, loss=0.8902, acc=78.37%]
232
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.95it/s]
233
+
234
+ Epoch 24/50
235
+ Train Loss: 0.8904 | Train Acc: 78.37%
236
+ Val Loss: 0.9102 | Val Acc: 77.63%
237
+ Mean Alpha: 0.9565 | Mean Density: 0.0391
238
+ βœ“ New best model saved! (Val Acc: 77.63%)
239
+ Epoch 25/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.03it/s, loss=0.8809, acc=78.49%]
240
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 19.09it/s]
241
+
242
+ Epoch 25/50
243
+ Train Loss: 0.8809 | Train Acc: 78.49%
244
+ Val Loss: 0.9020 | Val Acc: 77.70%
245
+ Mean Alpha: 0.9587 | Mean Density: 0.0391
246
+ βœ“ New best model saved! (Val Acc: 77.70%)
247
+ Epoch 26/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.18it/s, loss=0.8725, acc=78.60%]
248
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.63it/s]
249
+
250
+ Epoch 26/50
251
+ Train Loss: 0.8724 | Train Acc: 78.60%
252
+ Val Loss: 0.8949 | Val Acc: 77.78%
253
+ Mean Alpha: 0.9606 | Mean Density: 0.0391
254
+ βœ“ New best model saved! (Val Acc: 77.78%)
255
+ Epoch 27/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.13it/s, loss=0.8651, acc=78.71%]
256
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.61it/s]
257
+
258
+ Epoch 27/50
259
+ Train Loss: 0.8651 | Train Acc: 78.71%
260
+ Val Loss: 0.8885 | Val Acc: 77.87%
261
+ Mean Alpha: 0.9623 | Mean Density: 0.0391
262
+ βœ“ New best model saved! (Val Acc: 77.87%)
263
+ Epoch 28/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.20it/s, loss=0.8585, acc=78.79%]
264
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.69it/s]
265
+
266
+ Epoch 28/50
267
+ Train Loss: 0.8584 | Train Acc: 78.79%
268
+ Val Loss: 0.8827 | Val Acc: 77.92%
269
+ Mean Alpha: 0.9637 | Mean Density: 0.0391
270
+ βœ“ New best model saved! (Val Acc: 77.92%)
271
+ Epoch 29/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.20it/s, loss=0.8527, acc=78.87%]
272
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.76it/s]
273
+
274
+ Epoch 29/50
275
+ Train Loss: 0.8526 | Train Acc: 78.87%
276
+ Val Loss: 0.8778 | Val Acc: 77.97%
277
+ Mean Alpha: 0.9650 | Mean Density: 0.0391
278
+ βœ“ New best model saved! (Val Acc: 77.97%)
279
+ Epoch 30/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.13it/s, loss=0.8475, acc=78.95%]
280
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.94it/s]
281
+
282
+ Epoch 30/50
283
+ Train Loss: 0.8476 | Train Acc: 78.95%
284
+ Val Loss: 0.8733 | Val Acc: 78.03%
285
+ Mean Alpha: 0.9661 | Mean Density: 0.0391
286
+ βœ“ New best model saved! (Val Acc: 78.03%)
287
+ Epoch 31/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.13it/s, loss=0.8429, acc=79.02%]
288
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.93it/s]
289
+
290
+ Epoch 31/50
291
+ Train Loss: 0.8429 | Train Acc: 79.02%
292
+ Val Loss: 0.8694 | Val Acc: 78.08%
293
+ Mean Alpha: 0.9671 | Mean Density: 0.0391
294
+ βœ“ New best model saved! (Val Acc: 78.08%)
295
+ Epoch 32/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.11it/s, loss=0.8387, acc=79.06%]
296
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 19.07it/s]
297
+
298
+ Epoch 32/50
299
+ Train Loss: 0.8387 | Train Acc: 79.06%
300
+ Val Loss: 0.8660 | Val Acc: 78.11%
301
+ Mean Alpha: 0.9680 | Mean Density: 0.0391
302
+ βœ“ New best model saved! (Val Acc: 78.11%)
303
+ Epoch 33/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:58<00:00, 19.07it/s, loss=0.8351, acc=79.12%]
304
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.81it/s]
305
+
306
+ Epoch 33/50
307
+ Train Loss: 0.8351 | Train Acc: 79.12%
308
+ Val Loss: 0.8629 | Val Acc: 78.18%
309
+ Mean Alpha: 0.9687 | Mean Density: 0.0391
310
+ βœ“ New best model saved! (Val Acc: 78.18%)
311
+ Epoch 34/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.14it/s, loss=0.8319, acc=79.16%]
312
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.85it/s]
313
+
314
+ Epoch 34/50
315
+ Train Loss: 0.8320 | Train Acc: 79.16%
316
+ Val Loss: 0.8602 | Val Acc: 78.21%
317
+ Mean Alpha: 0.9694 | Mean Density: 0.0391
318
+ βœ“ New best model saved! (Val Acc: 78.21%)
319
+ Epoch 35/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.21it/s, loss=0.8290, acc=79.21%]
320
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.74it/s]
321
+
322
+ Epoch 35/50
323
+ Train Loss: 0.8289 | Train Acc: 79.21%
324
+ Val Loss: 0.8578 | Val Acc: 78.24%
325
+ Mean Alpha: 0.9699 | Mean Density: 0.0391
326
+ βœ“ New best model saved! (Val Acc: 78.24%)
327
+ Epoch 36/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.16it/s, loss=0.8265, acc=79.25%]
328
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.69it/s]
329
+
330
+ Epoch 36/50
331
+ Train Loss: 0.8265 | Train Acc: 79.25%
332
+ Val Loss: 0.8558 | Val Acc: 78.26%
333
+ Mean Alpha: 0.9704 | Mean Density: 0.0391
334
+ βœ“ New best model saved! (Val Acc: 78.26%)
335
+ Epoch 37/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.19it/s, loss=0.8243, acc=79.28%]
336
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.76it/s]
337
+
338
+ Epoch 37/50
339
+ Train Loss: 0.8243 | Train Acc: 79.28%
340
+ Val Loss: 0.8540 | Val Acc: 78.28%
341
+ Mean Alpha: 0.9709 | Mean Density: 0.0391
342
+ βœ“ New best model saved! (Val Acc: 78.28%)
343
+ Epoch 38/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.18it/s, loss=0.8224, acc=79.31%]
344
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.97it/s]
345
+
346
+ Epoch 38/50
347
+ Train Loss: 0.8224 | Train Acc: 79.31%
348
+ Val Loss: 0.8525 | Val Acc: 78.30%
349
+ Mean Alpha: 0.9712 | Mean Density: 0.0391
350
+ βœ“ New best model saved! (Val Acc: 78.30%)
351
+ Epoch 39/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.19it/s, loss=0.8208, acc=79.33%]
352
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.89it/s]
353
+
354
+ Epoch 39/50
355
+ Train Loss: 0.8207 | Train Acc: 79.33%
356
+ Val Loss: 0.8512 | Val Acc: 78.32%
357
+ Mean Alpha: 0.9715 | Mean Density: 0.0391
358
+ βœ“ New best model saved! (Val Acc: 78.32%)
359
+ Epoch 40/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.13it/s, loss=0.8194, acc=79.35%]
360
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.91it/s]
361
+
362
+ Epoch 40/50
363
+ Train Loss: 0.8194 | Train Acc: 79.35%
364
+ Val Loss: 0.8501 | Val Acc: 78.33%
365
+ Mean Alpha: 0.9718 | Mean Density: 0.0391
366
+ βœ“ New best model saved! (Val Acc: 78.33%)
367
+ Epoch 41/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.14it/s, loss=0.8184, acc=79.37%]
368
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.88it/s]
369
+
370
+ Epoch 41/50
371
+ Train Loss: 0.8184 | Train Acc: 79.37%
372
+ Val Loss: 0.8492 | Val Acc: 78.35%
373
+ Mean Alpha: 0.9720 | Mean Density: 0.0391
374
+ βœ“ New best model saved! (Val Acc: 78.35%)
375
+ Epoch 42/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2253/2253 [01:57<00:00, 19.22it/s, loss=0.8174, acc=79.39%]
376
+ Evaluating: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 251/251 [00:13<00:00, 18.50it/s]
377
+
378
+ Epoch 42/50
379
+ Train Loss: 0.8174 | Train Acc: 79.39%
380
+ Val Loss: 0.8485 | Val Acc: 78.36%
381
+ Mean Alpha: 0.9722 | Mean Density: 0.0391
382
+ βœ“ New best model saved! (Val Acc: 78.36%)
383
+ Epoch 43/50: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 2158/2253 [01:53<00:05, 17.16it/s, loss=0.8170, acc=79.38%]