-
Notifications
You must be signed in to change notification settings - Fork 159
Comparing changes
Open a pull request
base repository: dmlc/dlpack
base: v1.1
head repository: dmlc/dlpack
compare: v1.2
- 10 commits
- 5 files changed
- 10 contributors
Commits on Mar 13, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 22eb042 - Browse repository at this point
Copy the full SHA 22eb042View commit details
Commits on Mar 14, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 2541d71 - Browse repository at this point
Copy the full SHA 2541d71View commit details
Commits on Mar 25, 2025
-
chore: fix typos in dlpack.h comments and notes (#161)
- Correct 'multiply' to 'multiple' in data alignment comment. - Correct 'aligment' to 'alignment' in data alignment comment. - Fix typo 'in in' to 'in' in DLManagedTensorVersioned comment.
Configuration menu - View commit details
-
Copy full SHA for 83af1e7 - Browse repository at this point
Copy the full SHA 83af1e7View commit details
Commits on Mar 31, 2025
-
Update CMakeLists.txt: change min cmake version to 3.5 (#163)
Changming Sun authoredMar 31, 2025 Configuration menu - View commit details
-
Copy full SHA for 490e31b - Browse repository at this point
Copy the full SHA 490e31bView commit details
Commits on Apr 10, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 688d2ba - Browse repository at this point
Copy the full SHA 688d2baView commit details
Commits on May 12, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 5c210da - Browse repository at this point
Copy the full SHA 5c210daView commit details
Commits on Jun 10, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 7f393bb - Browse repository at this point
Copy the full SHA 7f393bbView commit details
Commits on Sep 4, 2025
-
Use commit hash to download lint script (#171)
Since it was deleted in dmlc/dmlc-core#693
Configuration menu - View commit details
-
Copy full SHA for addbc8b - Browse repository at this point
Copy the full SHA addbc8bView commit details
Commits on Oct 11, 2025
-
[RFC] Support DLPACK C Functions for Speed Exchange and Stream Handli…
…ng (#174) * [RFC] Support DLPACK C Functions for Speed Exchange and Stream Handling This PR adds support for three C functions to speedup DLPack exchange. As of now, DLPack exchange relies on python functions such as tensor.__dlpack__(). While they works well for common cases, the general overhead of such exchange is at the level of 0.2-0.3 us for very well optimized version, and can go up to 0.4-1 us for less optimized implementation. For a function that takes three arguments f(a, b, c), assume we run DLPack exchange for each argument, the general conversion overhead usually gets to around 1us and sometimes to 3us. While such overhead can be acceptable in many settings, in GPU applications the extra 1-3us overhead can still be significant. This PR proposes three functions for speed exchange DLPack tensors without going through python interpreter. - DLPackFromPyObject: exports a PyObject Tensor to DLManagedTensorVesioned - DLPackToPyObject: DLManagedTensorVesioned converts to a PyObject Tensor - DLPackTensorAllocator: Used to expose one package's tensor allocator to another package - This allows for example we implement libraries that allocates intermediate tensor based on the caller's specified Tensor Allocator. Our preliminary results show that these functions, when incorporated correctly via native extensions such as c/c++, can bring exchange cost to the level of 30ns - 80ns, giving us about one order of maginitude speedup. That means functions like f(a, b, c) can finish at 0.2us-0.4us level, which is close to what native cpp extension overhead do without exchange. * Incorporate suggestions to use a global table for all function pointers * Update naming and specification according to feedbacks * Add maximum version parameter * Update to include WorkStream proposal, move version to API chain * Update based on latest feedback * Incorporate feedbacks from sberg Co-authored-by: Sebastian Berg <sebastianb@nvidia.com> * Update include/dlpack/dlpack.h Co-authored-by: Sebastian Berg <sebastian@sipsolutions.net> * Update include/dlpack/dlpack.h Co-authored-by: Sebastian Berg <sebastian@sipsolutions.net> * Move exchange api to common header * Update include/dlpack/dlpack.h Co-authored-by: Lisandro Dalcin <dalcinl@gmail.com> --------- Co-authored-by: Sebastian Berg <sebastianb@nvidia.com> Co-authored-by: Sebastian Berg <sebastian@sipsolutions.net> Co-authored-by: Lisandro Dalcin <dalcinl@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 1117366 - Browse repository at this point
Copy the full SHA 1117366View commit details -
Enforce strides to be not null when ndim is nonzero (#178)
* Enforce strides to be not null when ndim is nonzero This PR updates the spec to enforce the strides to be not null when the ndim is nonzero. This is going to help consumers to handle cases more uniformly. * Add more comments on the null rationale * Remove mention of framework handling * Update NULL note, bump minor
Configuration menu - View commit details
-
Copy full SHA for 93c8f2a - Browse repository at this point
Copy the full SHA 93c8f2aView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v1.1...v1.2