
Alan Dao's personal blog
AI Researcher
For work π
- Email: [email protected]
- Github: https://github.com/tikikun
- Twitter: https://x.com/alandao_ai
About me π
Passionate AI Practitioner that wants to build things!
Some of the things I have built, contributed and invented with my dear friends and colleagues.
Things I have built!
Ichigo π
Ichigo is a multimodal AI model that can handle speech natively with enhanced latency compared to traditional ASR to LLM solution. Find out more below
AlphaMaze π
AlphaMaze is a novel two-stage training framework that equips LLMs with visual reasoning abilities for maze navigation using Supervised Fine-Tuning (SFT) and Group Relative Policy Optimization (GRPO).
Poseless π
PoseLess is a novel framework for robot hand control that maps 2D images to joint angles without explicit pose estimation, enabling zero-shot generalization and cross-morphology transfer.
Jan π
Jan is the most popular open source AI Chatbot Client with more than 1.5M downloads (and counting up)
Nitro (Now is cortex) π
Super light-weight inference engine for LLM models
- https://github.com/janhq/cortex (previously nitro - I invented this)
- https://github.com/janhq/nitro-tensorrt-llm
Publication π
I also wrote some papers! (More coming soon.) Check out my Google Scholar here.
-
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
Alan Dao, Dinh Bach Vu, Huy Hoang Ha
arXiv:2410.15316 -
AlphaMaze: Enhancing Large Language Models’ Spatial Intelligence via GRPO
Alan Dao, Dinh Bach Vu
arXiv:2502.14669 -
PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM
Alan Dao, Dinh Bach Vu, Tuan Le Duc Anh, Bui Quang Huy
arXiv:2503.07111
I believe that in the next 10 years, everyone will have access to the most sophisticated and well-tailored education system that has ever existed, thanks to mass AI adoption.
The future is bright! π