Skip to content

Non-record: Mamba-Inspired SSM Hybrid 3:1 (val_bpb 3.3168)#1197

Open
dentity007 wants to merge 3 commits intoopenai:mainfrom
NathanMaine:research/mamba-hybrid
Open

Non-record: Mamba-Inspired SSM Hybrid 3:1 (val_bpb 3.3168)#1197
dentity007 wants to merge 3 commits intoopenai:mainfrom
NathanMaine:research/mamba-hybrid

Conversation

@dentity007
Copy link
Copy Markdown

Pure PyTorch SSM (no custom CUDA kernels). 3:1 SSM:Attention ratio following Qwen3-Next/Kimi Linear pattern. Selective gating, causal Conv1d, SiLU output. Implements OpenAI's requested 'State-space models' direction.

🤖 Generated with Claude Code

@dentity007 dentity007 closed this Apr 1, 2026
@dentity007 dentity007 reopened this Apr 1, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant