Skip to content

mosure/burn_dragon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

32 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

burn_dragon_hatchling πŸ”₯πŸ‰πŸ£

test GitHub License crates.io

burn inference and training of dragon hatchling model

Alt text Alt text

features

  • cached inference
  • training benchmarks and reporting
  • pope positional embeddings
  • wasm deployments
  • adaptive tool discovery
  • conditional (deep) gating
  • document-coherent dataloading and scale mixup
  • episodic memory
  • fused kernels
  • hierarchical, memory-aware recurrent state
  • mixture-of-expert routing
  • multi-modal architecture
  • multi-stream truncated backpropagation through time
  • neuromorphic backend
  • rl reasoning training
  • sparsity metrics and visualization
  • streaming, sparse synaptic backpropagation
  • temporal neuron dampening

Dataset configuration (built-in presets and Hugging Face examples) is documented inline in config/base.toml.

training

  • cargo run --release (defaults to the cuda backend)

inference

  • cargo run --bin infer -- --max-tokens 2048 --streaming

benchmarks

  • cargo bench (executes both wgpu and cuda benchmarks)
  • open target/criterion/report/index.html

compatible burn versions

burn_dragon_hatchling burn
0.2 0.19
0.1 0.18

license

licensed under either of

at your option.

contribution

unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Alt text

About

burn inference and training of dragon models πŸ”₯πŸ‰

Topics

Resources

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT

Stars

Watchers

Forks

Releases

No releases published

Contributors