Skip to content
View ihb2032's full-sized avatar

Block or report ihb2032

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ihb2032/README.md

Hi there πŸ‘‹

πŸŽ“ About Me

I'm a university student passionate about RISC-V architecture and deep learning systems.

During my internship at the ISCAS July - December 2025, I contributed to optimizing neural network inference frameworks for RISC-V platforms.


πŸ“Š GitHub Stats

GitHub Stats Top Languages


πŸ’Ό Internship Contributions

🧠 MNN - RISC-V Vector Extension Optimization

Implemented 50+ optimized kernels using RISC-V Vector instructions:

  • Data Packing: MNNPackC2/C4, MNNUnpackC4, MNNTranspose32Bit/16Bit β€” #4021, #4023
  • Matrix Ops: MNNMatrixProd/Add/Sub/Max, MNNMaxFloat/MinFloat β€” #3779, #3913, #4036
  • Convolution: MNNConvRunForLineDepthwise, MNNDeconvRunForUnitDepthWise β€” #4042
  • Activation: MNNSoftmax, MNNReluWithSlopeChannel β€” #4044
  • Interpolation: CPUBilinearSampleC4, MNNCubicSampleC4/C16 β€” #4053
  • Image Processing: MNNBGRAToBGR, MNNC3ToHSV/XYZ/YUV, MNNNV21ToRGB β€” #4067, #4079

πŸ€– vLLM - RISC-V Platform Adaptation

Contributed 8 merged PRs to enable vLLM on RISC-V:

PR Description
#24951 Add OpenMP detection logic
#25816 Add RISC-V to chunked detection exclusion list
#26228 Force float32 on RISC-V for model compatibility
#26401 Fix nix build & add local oneDNN support
#26693 Disable torch compile on RISC-V
#28847 Refactor with unroll_loop for better performance
xgrammar#458 Port xgrammar library to RISC-V

πŸ› οΈ Skills

RISC-V C++ Python


πŸ“« Contact

Pinned Loading

  1. MNN MNN Public

    Forked from alibaba/MNN

    MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

    C++

  2. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  3. ncnn ncnn Public

    Forked from Tencent/ncnn

    ncnn is a high-performance neural network inference framework optimized for the mobile platform

    C++