mirror of https://github.com/maderix/ANE.git
Revise README for clarity and additional details
Updated README to clarify implementation details and added NEON CPU decode.
This commit is contained in:
parent
40a5384074
commit
70bfa4e54e
|
|
@ -8,11 +8,9 @@ GPT2 had more soul in it's theoretical pinky finger than all of us combined.
|
||||||
|
|
||||||
But I digress..
|
But I digress..
|
||||||
|
|
||||||
Training neural networks directly on Apple's Neural Engine (ANE) via reverse-engineered private APIs. No CoreML training APIs, no Metal, no GPU — pure ANE compute.
|
|
||||||
|
|
||||||
## What This Is
|
## What This Is
|
||||||
|
|
||||||
A from-scratch implementation of transformer training (forward + backward pass) running on the ANE in Apple Silicon. The ANE is a 15.8 TFLOPS (M4) inference accelerator that Apple does not expose for training. This project reverse-engineers the `_ANEClient` / `_ANECompiler` private APIs and the MIL (Model Intermediate Language) format to run custom compute graphs — including backpropagation — directly on ANE hardware.
|
A from-scratch implementation of transformer training (forward + backward pass) running on the ANE in Apple Silicon with NEON cpu decode.
|
||||||
|
|
||||||
I forked diz shit and need to write out everything different so stay tuned.
|
I forked diz shit and need to write out everything different so stay tuned.
|
||||||
|
|
||||||
|
|
@ -23,3 +21,4 @@ This project is independent research into Apple Neural Engine architecture. It u
|
||||||
## License
|
## License
|
||||||
|
|
||||||
MIT — see [LICENSE](LICENSE)
|
MIT — see [LICENSE](LICENSE)
|
||||||
|
|
||||||
|
|
|
||||||
Loading…
Reference in New Issue