ggml: WebGPU backend host improvements and style fixing#14978
ggml: WebGPU backend host improvements and style fixing#14978reeselevine merged 9 commits intoggml-org:masterfrom
Conversation
|
Looks like one test is failing, I'll investigate that. |
This comment was marked as spam.
This comment was marked as spam.
|
A couple updates:
Going forward, I'll roll my own releases of Dawn when necessary, and once the Dawn folks are able to get their CI working again. Otherwise the code here is ready for review/merging. |
ggerganov
left a comment
There was a problem hiding this comment.
Going forward, I'll roll my own releases of Dawn when necessary, and once the Dawn folks are able to get their CI working again.
Sounds good 👍
|
@reeselevine It looks like the CI is stalling after a crash due to missing SET_ROWS support. |
|
@CISC yeah I noticed that after it was merged 😞. Simplest solution is to disable set rows for now, I have a commit ready: reeselevine@ae8edbf But I also am working on adding support for SET_ROWS. If I get that done today I'll open a PR with support for it, otherwise I'll open a quick PR with that fix. |
* Add parameter buffer pool, batching of submissions, refactor command building/submission * Add header for linux builds * Free staged parameter buffers at once * Format with clang-format * Fix thread-safe implementation * Use device implicit synchronization * Update workflow to use custom release * Remove testing branch workflow
This PR adds a few improvements to the host-side setup for WebGPU, which should make it easier to add more operations/improve performance: