Commit Graph

2 Commits

Author SHA1 Message Date
maderix 20cd236f61 Add INT8 W8A8 support: 1.88x ANE throughput via quantize/dequantize MIL ops
- ane_int8_bench.m: standalone FP16 vs INT8 W8A8 benchmark (35.1 vs 18.6 TOPS on M4)
- bridge: add int8 weight blob builders (ane_bridge_build_weight_blob_int8, quantized)
- bridge: fix weight dict nil → @{} (prevents silent compile failure)
- README: update with Qwen3-0.6B, GQA, GPU↔ANE pipeline, INT8 results, file structure
2026-03-09 19:47:01 -07:00
Vipul ebac5dd73f Python Bridge+Memory leak fix+More functions 2026-03-03 02:04:36 -05:00