Summary
Elevate aprender-shell from solid baseline to world-class AI shell completion.
Current State (v0.2.0)
- N-gram language model (trigram default)
- Sub-10ms latency, 2.7MB binary
- Local-only, privacy-preserving
- Basic augmentation and AutoML tuning
Gap Analysis
| Feature |
Current |
World Class |
| Model |
N-gram |
Transformer (CodeT5) |
| Understanding |
Pattern matching |
Semantic |
| Context |
Command prefix only |
Dir, git, env, history |
| Learning |
Per-session |
Cross-session, federated |
| NL support |
None |
"list large files" → find . -size +100M |
Roadmap to World Class
Phase 1: Context Awareness
Phase 2: Semantic Understanding
Phase 3: Transformer Integration
Phase 4: Natural Language
Phase 5: Cross-Session Learning
Technical Requirements
- Latency: <50ms (transformer), <10ms (fallback n-gram)
- Binary size: <20MB (with embedded model)
- Memory: <100MB runtime
- Privacy: Local-first, opt-in cloud features
- Platforms: Linux, macOS, Windows
Dependencies from aprender
From more-learning-specs.md:
- §3: Contrastive learning (command embeddings)
- §31: Code-specific ML (AST tokenizer)
- §32: Embedding & retrieval (semantic search)
- §33: Transfer learning (cross-user knowledge)
Success Metrics
| Metric |
Current |
Target |
| Top-1 accuracy |
~40% |
>70% |
| Top-5 accuracy |
~70% |
>90% |
| User satisfaction |
Baseline |
4.5/5 stars |
| Daily active usage |
N/A |
>1000 users |
References
Summary
Elevate aprender-shell from solid baseline to world-class AI shell completion.
Current State (v0.2.0)
Gap Analysis
find . -size +100MRoadmap to World Class
Phase 1: Context Awareness
Phase 2: Semantic Understanding
Phase 3: Transformer Integration
Phase 4: Natural Language
find . -size +100MPhase 5: Cross-Session Learning
Technical Requirements
Dependencies from aprender
From
more-learning-specs.md:Success Metrics
References