Add bash script demo and create results folder#804
Open
klei22 wants to merge 1 commit intoReaLLMASIC:masterfrom
Open
Add bash script demo and create results folder#804klei22 wants to merge 1 commit intoReaLLMASIC:masterfrom
klei22 wants to merge 1 commit intoReaLLMASIC:masterfrom
Conversation
Bash script will recursively obtain the length extrapolation data for all out subdirectory checkpoints. report/results will be designated for experiment results, with 1) folders 2) notes 3) timestamps. and timestamps.
There was a problem hiding this comment.
Pull request overview
Adds a small demo entrypoint for running the checkpoint block-size evaluation sweep, and introduces a report/results output artifact (an HTML Plotly report) intended to hold experiment results.
Changes:
- Add
demos/ckpt_block_size_eval_sweep.shto rundemos/ckpt_block_size_eval_plotly.pyover a fixed set of block sizes. - Add a generated Plotly HTML report under
report/results/.
Reviewed changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.
| File | Description |
|---|---|
demos/ckpt_block_size_eval_sweep.sh |
New bash wrapper to run a block-size eval sweep. |
report/results/ckpt_block_size_eval_report.html |
Added Plotly HTML output placed under the new results folder. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Comment on lines
+1
to
+5
| <html> | ||
| <head><meta charset="utf-8" /></head> | ||
| <body> | ||
| <div> <script type="text/javascript">window.PlotlyConfig = {MathJaxConfig: 'local'};</script> | ||
| <script charset="utf-8" src="https://cdn.plot.ly/plotly-2.35.2.min.js"></script> <div id="e1f46763-6265-41a2-94c9-307d3f6e982a" class="plotly-graph-div" style="height:700px; width:100%;"></div> <script type="text/javascript"> window.PLOTLYENV=window.PLOTLYENV || {}; if (document.getElementById("e1f46763-6265-41a2-94c9-307d3f6e982a")) { Plotly.newPlot( "e1f46763-6265-41a2-94c9-307d3f6e982a", [{"error_y":{"array":[1.1711693127442473,1.2069127603500855,1.2932338339482161,1.2237863263055453,0.9881420032906614,1.2062250240918022],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.183177702963352,3.5970645273923876,4.538949589252472,5.021439916014671,5.6687283709049225,5.872678869605064],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2533550217176561,1.243741344360804,1.1575535124460987,1.2664144481264124,1.1859766109841912,0.8056470644432926],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1470313941836356,3.593449905633926,4.290417836129666,4.649716576814652,5.108523582220077,5.530939794778824],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.205119346474599,1.1108689415518582,1.1532471731431664,1.082693300357644,0.9796893916134162,0.9190606361476842],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.2100645226240156,3.142702328681946,3.04960131752491,2.974058710753918,3.244880418539047,3.2988163843154905],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.229232901595238,1.1836774691355163,1.0860294009143288,1.1580029797342473,1.0382611684145628,0.8666892101764104],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1972145063579083,3.165516791462898,3.05749903178215,2.994943890810013,3.1444412103891373,3.4713070793151855],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.229839656262297,1.381939779105441,1.306691581088126,1.3120070090787317,1.217565777025645,1.062129441919167],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.269234666407108,3.6860388265550137,4.52492835855484,5.2834278526306155,5.719171240091324,6.185392696619034],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.3098159031233516,1.2357584442614673,1.0833220952535523,1.1702447429563287,1.1778743148552069,0.8692764088336282],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1737595307826996,3.4813756090402603,4.189733317375183,4.694935292482376,5.155692719101906,5.569482795715332],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2011286483920067,1.2572724513651081,1.0710342867655078,1.126144849197639,0.9976592253403431,0.9658472980295552],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1955964788794518,3.3175913892388342,3.96429031175375,4.332895148038864,4.76487464594841,5.094436115264893],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1753309122500057,1.207787372660477,1.0707376656757182,1.1693297401092382,1.0084881814430224,0.827162707276734],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1499257571101187,3.37757970058918,3.8944128074645996,4.306054557979107,4.751133957266807,5.255770515918732],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2351446269691828,1.2863066411344837,1.0825616302548733,1.0768889947402653,1.0988993149944872,1.144190806868529],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.380117079615593,3.6494074739217757,4.687532418727875,5.231362653255463,5.60921201133728,5.854626818180084],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.136501198737402,1.2851702069867539,1.2073315770042965,1.21578759375896,1.1462557878310293,0.9875681828173191],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.5379876222610473,3.50727876663208,4.281125857830047,4.699422374963761,5.191025604009628,5.56436134815216],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2342970227752232,1.175612988552541,1.091663654677277,1.0588887231800772,1.1218907704195156,0.917319961032722],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.258898095309734,3.0858215336203574,3.074840038537979,3.067000065565109,3.251787336349487,3.464396350264549],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.185525293808795,1.1466517484444896,1.182428225675898,1.0986837694178198,1.1314727852895607,0.9905764986760235],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.227121212244034,3.2403261350393295,2.972999147415161,2.9670218584537507,3.113043026506901,3.311047913789749],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1903134138817184,1.2493285431749088,1.19925665419669,1.196886117474374,1.1331356900133243,1.129198647158318],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.283502102613449,3.662534193813801,4.7424332177639,5.297421558737755,5.686977095365524,6.06106441116333],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2053442189513872,1.3511622499720803,1.1001123122330165,1.1733978421658695,1.0602135098465564,1.1179816900664703],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.4600685362815855,3.5133343799710275,4.239967309892178,4.759841972827911,5.187277385950089,5.5142748484611515],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2143665465132572,1.2310661502721787,1.1949774434859886,1.1503374458008893,1.1148554610645267,0.9203632075138743],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1432563539743423,3.2863979100584984,3.750591731369495,4.3066768782138825,4.843042047858238,5.349640479803085],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1259984752955048,1.1945349481826402,1.1040730747862857,1.010597981285566,1.0208611855269478,1.1368784279804849],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.256227258324623,3.3625606961250307,3.864621636390686,4.420349858283997,4.911534878253937,5.118133749961853],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1764360456063292,0.9478469608279,0.9084475515708119,1.0766866270310014,0.9168887531644343,0.833632070533849],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.221931876540184,5.523249472618103,6.692046685218811,7.199555394172669,7.6187840890884395,7.835197998046875],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2394279917002198,1.1255587537245535,1.149191406884376,1.1035070666987088,1.0064056004385553,1.1622621318070323],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.222992011666298,5.301767000198364,6.672817112922669,7.256243708610534,7.6551225547790525,7.733409386634826],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2183268697417986,1.2300565512942794,1.2640032709624487,1.2271878463160477,0.9783458174200189,1.13605279804697],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1901979760527612,5.517080591678619,6.887702176094055,7.613463150024414,8.096415564537049,8.175094900131226],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1736881761216675,1.0378599901913137,1.208148098728911,1.0254582313944065,0.8832620786973406,0.9289085452633794],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.165625315666199,5.463376346588134,6.717776810646057,7.464271119117737,7.9781970710754395,8.055128838539124],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2241781795070361,1.0624087315676247,0.9937659525949298,0.9142798365743423,0.801380086347914,0.691710947734982],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.2479677442908286,5.626948791027069,6.593770612716675,7.220544089317322,7.6877972087860105,7.867616525650025],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2024886860369606,1.1315174820264926,1.0952386245804775,1.0221838558333842,0.9395963717761265,0.7912909355195166],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1796452652215956,5.355035945415497,6.434367649078369,7.277318877220154,7.658710151672364,7.88391362953186],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2942155048014963,1.0139610685302505,0.8538653413129483,0.9934852786352357,0.7725118128122352,0.7520174744059858],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.313060844838619,5.4544565935134885,6.634725425720215,7.136056174278259,7.593039022445678,7.8161044626235965],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2504385461165501,1.16241937803423,1.043149524763973,1.095017044264367,1.0040932950384753,0.9313356581767336],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.249216635465622,5.340493307113648,6.552237380981445,7.220886090278626,7.477792156219483,7.739381796836853],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1884332649300362,1.1702495435205096,1.1519425499935678,1.0887239824963162,0.9818759585546905,0.908490215126979],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-fire_only\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.3640550824403763,3.1673665215969087,3.034058128595352,3.155968694806099,3.408859871864319,3.582525681734085],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2223854928176026,1.2093481789889882,1.1912574473987574,1.102572508321873,0.9321391269438896,0.8503013323329739],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.267668638467789,3.0928504432439805,3.040422465980053,3.1298206661939623,3.4695431846380234,3.760889756441116],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2304045482692978,1.232504078019873,1.1138561654117038,1.1607864087234565,1.0483014714440935,0.9770614196192113],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-rope_only\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1902689230442047,3.5211612728238104,3.8598251264095307,4.675920508503914,5.546201281309128,6.430574028491974],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2833731454746273,1.1885666978856544,1.1535707564820554,1.010826630357373,1.031659062951179,0.8982088429086471],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.2691667317152024,3.360022331237793,3.784856849193573,4.178276120901108,4.820666652202606,5.1550086169242855],"type":"scatter","xaxis":"x","yaxis":"y"}], {"template":{"data":{"barpolar":[{"marker":{"line":{"color":"rgb(17,17,17)","width":0.5},"pattern":{"fillmode":"overlay","size":10,"solidity":0.2}},"type":"barpolar"}],"bar":[{"error_x":{"color":"#f2f5fa"},"error_y":{"color":"#f2f5fa"},"marker":{"line":{"color":"rgb(17,17,17)","width":0.5},"pattern":{"fillmode":"overlay","size":10,"solidity":0.2}},"type":"bar"}],"carpet":[{"aaxis":{"endlinecolor":"#A2B1C6","gridcolor":"#506784","linecolor":"#506784","minorgridcolor":"#506784","startlinecolor":"#A2B1C6"},"baxis":{"endlinecolor":"#A2B1C6","gridcolor":"#506784","linecolor":"#506784","minorgridcolor":"#506784","startlinecolor":"#A2B1C6"},"type":"carpet"}],"choropleth":[{"colorbar":{"outlinewidth":0,"ticks":""},"type":"choropleth"}],"contourcarpet":[{"colorbar":{"outlinewidth":0,"ticks":""},"type":"contourcarpet"}],"contour":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"contour"}],"heatmapgl":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"heatmapgl"}],"heatmap":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"heatmap"}],"histogram2dcontour":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"histogram2dcontour"}],"histogram2d":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"histogram2d"}],"histogram":[{"marker":{"pattern":{"fillmode":"overlay","size":10,"solidity":0.2}},"type":"histogram"}],"mesh3d":[{"colorbar":{"outlinewidth":0,"ticks":""},"type":"mesh3d"}],"parcoords":[{"line":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"parcoords"}],"pie":[{"automargin":true,"type":"pie"}],"scatter3d":[{"line":{"colorbar":{"outlinewidth":0,"ticks":""}},"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scatter3d"}],"scattercarpet":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scattercarpet"}],"scattergeo":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scattergeo"}],"scattergl":[{"marker":{"line":{"color":"#283442"}},"type":"scattergl"}],"scattermapbox":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scattermapbox"}],"scatterpolargl":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scatterpolargl"}],"scatterpolar":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scatterpolar"}],"scatter":[{"marker":{"line":{"color":"#283442"}},"type":"scatter"}],"scatterternary":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scatterternary"}],"surface":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"surface"}],"table":[{"cells":{"fill":{"color":"#506784"},"line":{"color":"rgb(17,17,17)"}},"header":{"fill":{"color":"#2a3f5f"},"line":{"color":"rgb(17,17,17)"}},"type":"table"}]},"layout":{"annotationdefaults":{"arrowcolor":"#f2f5fa","arrowhead":0,"arrowwidth":1},"autotypenumbers":"strict","coloraxis":{"colorbar":{"outlinewidth":0,"ticks":""}},"colorscale":{"diverging":[[0,"#8e0152"],[0.1,"#c51b7d"],[0.2,"#de77ae"],[0.3,"#f1b6da"],[0.4,"#fde0ef"],[0.5,"#f7f7f7"],[0.6,"#e6f5d0"],[0.7,"#b8e186"],[0.8,"#7fbc41"],[0.9,"#4d9221"],[1,"#276419"]],"sequential":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"sequentialminus":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]]},"colorway":["#636efa","#EF553B","#00cc96","#ab63fa","#FFA15A","#19d3f3","#FF6692","#B6E880","#FF97FF","#FECB52"],"font":{"color":"#f2f5fa"},"geo":{"bgcolor":"rgb(17,17,17)","lakecolor":"rgb(17,17,17)","landcolor":"rgb(17,17,17)","showlakes":true,"showland":true,"subunitcolor":"#506784"},"hoverlabel":{"align":"left"},"hovermode":"closest","mapbox":{"style":"dark"},"paper_bgcolor":"rgb(17,17,17)","plot_bgcolor":"rgb(17,17,17)","polar":{"angularaxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""},"bgcolor":"rgb(17,17,17)","radialaxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""}},"scene":{"xaxis":{"backgroundcolor":"rgb(17,17,17)","gridcolor":"#506784","gridwidth":2,"linecolor":"#506784","showbackground":true,"ticks":"","zerolinecolor":"#C8D4E3"},"yaxis":{"backgroundcolor":"rgb(17,17,17)","gridcolor":"#506784","gridwidth":2,"linecolor":"#506784","showbackground":true,"ticks":"","zerolinecolor":"#C8D4E3"},"zaxis":{"backgroundcolor":"rgb(17,17,17)","gridcolor":"#506784","gridwidth":2,"linecolor":"#506784","showbackground":true,"ticks":"","zerolinecolor":"#C8D4E3"}},"shapedefaults":{"line":{"color":"#f2f5fa"}},"sliderdefaults":{"bgcolor":"#C8D4E3","bordercolor":"rgb(17,17,17)","borderwidth":1,"tickwidth":0},"ternary":{"aaxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""},"baxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""},"bgcolor":"rgb(17,17,17)","caxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""}},"title":{"x":0.05},"updatemenudefaults":{"bgcolor":"#506784","borderwidth":0},"xaxis":{"automargin":true,"gridcolor":"#283442","linecolor":"#506784","ticks":"","title":{"standoff":15},"zerolinecolor":"#283442","zerolinewidth":2},"yaxis":{"automargin":true,"gridcolor":"#283442","linecolor":"#506784","ticks":"","title":{"standoff":15},"zerolinecolor":"#283442","zerolinewidth":2}}},"xaxis":{"anchor":"y","domain":[0.0,1.0],"title":{"text":"Block size"}},"yaxis":{"anchor":"x","domain":[0.0,1.0],"title":{"text":"Validation loss"}},"legend":{"title":{"text":"Checkpoint"}},"title":{"text":"Validation Loss vs Block Size (dataset=minipile)"},"height":700}, {"responsive": true} ) }; </script> </div> |
Comment on lines
+4
to
+6
| <div> <script type="text/javascript">window.PlotlyConfig = {MathJaxConfig: 'local'};</script> | ||
| <script charset="utf-8" src="https://cdn.plot.ly/plotly-2.35.2.min.js"></script> <div id="e1f46763-6265-41a2-94c9-307d3f6e982a" class="plotly-graph-div" style="height:700px; width:100%;"></div> <script type="text/javascript"> window.PLOTLYENV=window.PLOTLYENV || {}; if (document.getElementById("e1f46763-6265-41a2-94c9-307d3f6e982a")) { Plotly.newPlot( "e1f46763-6265-41a2-94c9-307d3f6e982a", [{"error_y":{"array":[1.1711693127442473,1.2069127603500855,1.2932338339482161,1.2237863263055453,0.9881420032906614,1.2062250240918022],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.183177702963352,3.5970645273923876,4.538949589252472,5.021439916014671,5.6687283709049225,5.872678869605064],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2533550217176561,1.243741344360804,1.1575535124460987,1.2664144481264124,1.1859766109841912,0.8056470644432926],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1470313941836356,3.593449905633926,4.290417836129666,4.649716576814652,5.108523582220077,5.530939794778824],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.205119346474599,1.1108689415518582,1.1532471731431664,1.082693300357644,0.9796893916134162,0.9190606361476842],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.2100645226240156,3.142702328681946,3.04960131752491,2.974058710753918,3.244880418539047,3.2988163843154905],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.229232901595238,1.1836774691355163,1.0860294009143288,1.1580029797342473,1.0382611684145628,0.8666892101764104],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-fire_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1972145063579083,3.165516791462898,3.05749903178215,2.994943890810013,3.1444412103891373,3.4713070793151855],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.229839656262297,1.381939779105441,1.306691581088126,1.3120070090787317,1.217565777025645,1.062129441919167],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.269234666407108,3.6860388265550137,4.52492835855484,5.2834278526306155,5.719171240091324,6.185392696619034],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.3098159031233516,1.2357584442614673,1.0833220952535523,1.1702447429563287,1.1778743148552069,0.8692764088336282],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1737595307826996,3.4813756090402603,4.189733317375183,4.694935292482376,5.155692719101906,5.569482795715332],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2011286483920067,1.2572724513651081,1.0710342867655078,1.126144849197639,0.9976592253403431,0.9658472980295552],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1955964788794518,3.3175913892388342,3.96429031175375,4.332895148038864,4.76487464594841,5.094436115264893],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1753309122500057,1.207787372660477,1.0707376656757182,1.1693297401092382,1.0084881814430224,0.827162707276734],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_235711-rotary_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1499257571101187,3.37757970058918,3.8944128074645996,4.306054557979107,4.751133957266807,5.255770515918732],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2351446269691828,1.2863066411344837,1.0825616302548733,1.0768889947402653,1.0988993149944872,1.144190806868529],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.380117079615593,3.6494074739217757,4.687532418727875,5.231362653255463,5.60921201133728,5.854626818180084],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.136501198737402,1.2851702069867539,1.2073315770042965,1.21578759375896,1.1462557878310293,0.9875681828173191],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.5379876222610473,3.50727876663208,4.281125857830047,4.699422374963761,5.191025604009628,5.56436134815216],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2342970227752232,1.175612988552541,1.091663654677277,1.0588887231800772,1.1218907704195156,0.917319961032722],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.258898095309734,3.0858215336203574,3.074840038537979,3.067000065565109,3.251787336349487,3.464396350264549],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.185525293808795,1.1466517484444896,1.182428225675898,1.0986837694178198,1.1314727852895607,0.9905764986760235],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-fire_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.227121212244034,3.2403261350393295,2.972999147415161,2.9670218584537507,3.113043026506901,3.311047913789749],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1903134138817184,1.2493285431749088,1.19925665419669,1.196886117474374,1.1331356900133243,1.129198647158318],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.283502102613449,3.662534193813801,4.7424332177639,5.297421558737755,5.686977095365524,6.06106441116333],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2053442189513872,1.3511622499720803,1.1001123122330165,1.1733978421658695,1.0602135098465564,1.1179816900664703],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.4600685362815855,3.5133343799710275,4.239967309892178,4.759841972827911,5.187277385950089,5.5142748484611515],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2143665465132572,1.2310661502721787,1.1949774434859886,1.1503374458008893,1.1148554610645267,0.9203632075138743],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1432563539743423,3.2863979100584984,3.750591731369495,4.3066768782138825,4.843042047858238,5.349640479803085],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1259984752955048,1.1945349481826402,1.1040730747862857,1.010597981285566,1.0208611855269478,1.1368784279804849],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_cyclic_23571113-rotary_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.256227258324623,3.3625606961250307,3.864621636390686,4.420349858283997,4.911534878253937,5.118133749961853],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1764360456063292,0.9478469608279,0.9084475515708119,1.0766866270310014,0.9168887531644343,0.833632070533849],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.221931876540184,5.523249472618103,6.692046685218811,7.199555394172669,7.6187840890884395,7.835197998046875],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2394279917002198,1.1255587537245535,1.149191406884376,1.1035070666987088,1.0064056004385553,1.1622621318070323],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.222992011666298,5.301767000198364,6.672817112922669,7.256243708610534,7.6551225547790525,7.733409386634826],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2183268697417986,1.2300565512942794,1.2640032709624487,1.2271878463160477,0.9783458174200189,1.13605279804697],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1901979760527612,5.517080591678619,6.887702176094055,7.613463150024414,8.096415564537049,8.175094900131226],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1736881761216675,1.0378599901913137,1.208148098728911,1.0254582313944065,0.8832620786973406,0.9289085452633794],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-fire_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.165625315666199,5.463376346588134,6.717776810646057,7.464271119117737,7.9781970710754395,8.055128838539124],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2241781795070361,1.0624087315676247,0.9937659525949298,0.9142798365743423,0.801380086347914,0.691710947734982],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.2479677442908286,5.626948791027069,6.593770612716675,7.220544089317322,7.6877972087860105,7.867616525650025],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2024886860369606,1.1315174820264926,1.0952386245804775,1.0221838558333842,0.9395963717761265,0.7912909355195166],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_off-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1796452652215956,5.355035945415497,6.434367649078369,7.277318877220154,7.658710151672364,7.88391362953186],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2942155048014963,1.0139610685302505,0.8538653413129483,0.9934852786352357,0.7725118128122352,0.7520174744059858],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-False\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.313060844838619,5.4544565935134885,6.634725425720215,7.136056174278259,7.593039022445678,7.8161044626235965],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2504385461165501,1.16241937803423,1.043149524763973,1.095017044264367,1.0040932950384753,0.9313356581767336],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt","abs_pos_combinations_long-abs_norm_on-abs_learned-rotary_on-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.249216635465622,5.340493307113648,6.552237380981445,7.220886090278626,7.477792156219483,7.739381796836853],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.1884332649300362,1.1702495435205096,1.1519425499935678,1.0887239824963162,0.9818759585546905,0.908490215126979],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-fire_only\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt","abs_pos_combinations_long-fire_only\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.3640550824403763,3.1673665215969087,3.034058128595352,3.155968694806099,3.408859871864319,3.582525681734085],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2223854928176026,1.2093481789889882,1.1912574473987574,1.102572508321873,0.9321391269438896,0.8503013323329739],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-fire_only-hyperspherenorm-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.267668638467789,3.0928504432439805,3.040422465980053,3.1298206661939623,3.4695431846380234,3.760889756441116],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2304045482692978,1.232504078019873,1.1138561654117038,1.1607864087234565,1.0483014714440935,0.9770614196192113],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-rope_only\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt","abs_pos_combinations_long-rope_only\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.1902689230442047,3.5211612728238104,3.8598251264095307,4.675920508503914,5.546201281309128,6.430574028491974],"type":"scatter","xaxis":"x","yaxis":"y"},{"error_y":{"array":[1.2833731454746273,1.1885666978856544,1.1535707564820554,1.010826630357373,1.031659062951179,0.8982088429086471],"type":"data","visible":true},"hovertemplate":"ckpt=%{text}\u003cbr\u003eblock_size=%{x}\u003cbr\u003eval_loss=%{y:.6f}\u003cextra\u003e\u003c\u002fextra\u003e","mode":"lines+markers","name":"abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","showlegend":true,"text":["abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt","abs_pos_combinations_long-rope_only-hyperspherenorm-True\u002fckpt.pt"],"x":[256.0,512.0,1024.0,2048.0,4096.0,8192.0],"y":[3.2691667317152024,3.360022331237793,3.784856849193573,4.178276120901108,4.820666652202606,5.1550086169242855],"type":"scatter","xaxis":"x","yaxis":"y"}], {"template":{"data":{"barpolar":[{"marker":{"line":{"color":"rgb(17,17,17)","width":0.5},"pattern":{"fillmode":"overlay","size":10,"solidity":0.2}},"type":"barpolar"}],"bar":[{"error_x":{"color":"#f2f5fa"},"error_y":{"color":"#f2f5fa"},"marker":{"line":{"color":"rgb(17,17,17)","width":0.5},"pattern":{"fillmode":"overlay","size":10,"solidity":0.2}},"type":"bar"}],"carpet":[{"aaxis":{"endlinecolor":"#A2B1C6","gridcolor":"#506784","linecolor":"#506784","minorgridcolor":"#506784","startlinecolor":"#A2B1C6"},"baxis":{"endlinecolor":"#A2B1C6","gridcolor":"#506784","linecolor":"#506784","minorgridcolor":"#506784","startlinecolor":"#A2B1C6"},"type":"carpet"}],"choropleth":[{"colorbar":{"outlinewidth":0,"ticks":""},"type":"choropleth"}],"contourcarpet":[{"colorbar":{"outlinewidth":0,"ticks":""},"type":"contourcarpet"}],"contour":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"contour"}],"heatmapgl":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"heatmapgl"}],"heatmap":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"heatmap"}],"histogram2dcontour":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"histogram2dcontour"}],"histogram2d":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"histogram2d"}],"histogram":[{"marker":{"pattern":{"fillmode":"overlay","size":10,"solidity":0.2}},"type":"histogram"}],"mesh3d":[{"colorbar":{"outlinewidth":0,"ticks":""},"type":"mesh3d"}],"parcoords":[{"line":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"parcoords"}],"pie":[{"automargin":true,"type":"pie"}],"scatter3d":[{"line":{"colorbar":{"outlinewidth":0,"ticks":""}},"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scatter3d"}],"scattercarpet":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scattercarpet"}],"scattergeo":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scattergeo"}],"scattergl":[{"marker":{"line":{"color":"#283442"}},"type":"scattergl"}],"scattermapbox":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scattermapbox"}],"scatterpolargl":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scatterpolargl"}],"scatterpolar":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scatterpolar"}],"scatter":[{"marker":{"line":{"color":"#283442"}},"type":"scatter"}],"scatterternary":[{"marker":{"colorbar":{"outlinewidth":0,"ticks":""}},"type":"scatterternary"}],"surface":[{"colorbar":{"outlinewidth":0,"ticks":""},"colorscale":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"type":"surface"}],"table":[{"cells":{"fill":{"color":"#506784"},"line":{"color":"rgb(17,17,17)"}},"header":{"fill":{"color":"#2a3f5f"},"line":{"color":"rgb(17,17,17)"}},"type":"table"}]},"layout":{"annotationdefaults":{"arrowcolor":"#f2f5fa","arrowhead":0,"arrowwidth":1},"autotypenumbers":"strict","coloraxis":{"colorbar":{"outlinewidth":0,"ticks":""}},"colorscale":{"diverging":[[0,"#8e0152"],[0.1,"#c51b7d"],[0.2,"#de77ae"],[0.3,"#f1b6da"],[0.4,"#fde0ef"],[0.5,"#f7f7f7"],[0.6,"#e6f5d0"],[0.7,"#b8e186"],[0.8,"#7fbc41"],[0.9,"#4d9221"],[1,"#276419"]],"sequential":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]],"sequentialminus":[[0.0,"#0d0887"],[0.1111111111111111,"#46039f"],[0.2222222222222222,"#7201a8"],[0.3333333333333333,"#9c179e"],[0.4444444444444444,"#bd3786"],[0.5555555555555556,"#d8576b"],[0.6666666666666666,"#ed7953"],[0.7777777777777778,"#fb9f3a"],[0.8888888888888888,"#fdca26"],[1.0,"#f0f921"]]},"colorway":["#636efa","#EF553B","#00cc96","#ab63fa","#FFA15A","#19d3f3","#FF6692","#B6E880","#FF97FF","#FECB52"],"font":{"color":"#f2f5fa"},"geo":{"bgcolor":"rgb(17,17,17)","lakecolor":"rgb(17,17,17)","landcolor":"rgb(17,17,17)","showlakes":true,"showland":true,"subunitcolor":"#506784"},"hoverlabel":{"align":"left"},"hovermode":"closest","mapbox":{"style":"dark"},"paper_bgcolor":"rgb(17,17,17)","plot_bgcolor":"rgb(17,17,17)","polar":{"angularaxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""},"bgcolor":"rgb(17,17,17)","radialaxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""}},"scene":{"xaxis":{"backgroundcolor":"rgb(17,17,17)","gridcolor":"#506784","gridwidth":2,"linecolor":"#506784","showbackground":true,"ticks":"","zerolinecolor":"#C8D4E3"},"yaxis":{"backgroundcolor":"rgb(17,17,17)","gridcolor":"#506784","gridwidth":2,"linecolor":"#506784","showbackground":true,"ticks":"","zerolinecolor":"#C8D4E3"},"zaxis":{"backgroundcolor":"rgb(17,17,17)","gridcolor":"#506784","gridwidth":2,"linecolor":"#506784","showbackground":true,"ticks":"","zerolinecolor":"#C8D4E3"}},"shapedefaults":{"line":{"color":"#f2f5fa"}},"sliderdefaults":{"bgcolor":"#C8D4E3","bordercolor":"rgb(17,17,17)","borderwidth":1,"tickwidth":0},"ternary":{"aaxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""},"baxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""},"bgcolor":"rgb(17,17,17)","caxis":{"gridcolor":"#506784","linecolor":"#506784","ticks":""}},"title":{"x":0.05},"updatemenudefaults":{"bgcolor":"#506784","borderwidth":0},"xaxis":{"automargin":true,"gridcolor":"#283442","linecolor":"#506784","ticks":"","title":{"standoff":15},"zerolinecolor":"#283442","zerolinewidth":2},"yaxis":{"automargin":true,"gridcolor":"#283442","linecolor":"#506784","ticks":"","title":{"standoff":15},"zerolinecolor":"#283442","zerolinewidth":2}}},"xaxis":{"anchor":"y","domain":[0.0,1.0],"title":{"text":"Block size"}},"yaxis":{"anchor":"x","domain":[0.0,1.0],"title":{"text":"Validation loss"}},"legend":{"title":{"text":"Checkpoint"}},"title":{"text":"Validation Loss vs Block Size (dataset=minipile)"},"height":700}, {"responsive": true} ) }; </script> </div> | ||
| </body> |
| @@ -0,0 +1,7 @@ | |||
| #!/bin/bash | |||
Comment on lines
+3
to
+7
| python3 demos/ckpt_block_size_eval_plotly.py \ | ||
| --block_sizes 256 512 1024 2048 4096 8192 \ | ||
| --dtype bfloat16 \ | ||
| --dark_mode \ | ||
| out |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Bash script will recursively obtain the length extrapolation data for all out subdirectory checkpoints.
report/results will be designated for experiment results, with 1) folders 2) notes 3) timestamps.