Create subsets of pyarrow package with pyarrow-core < pyarrow < pyarrow-all and update to Arrow v16.0.0#1255
Create subsets of pyarrow package with pyarrow-core < pyarrow < pyarrow-all and update to Arrow v16.0.0#1255raulcd wants to merge 0 commit intoconda-forge:mainfrom
Conversation
|
Hi! This is the friendly automated conda-forge-linting service. I just wanted to let you know that I linted all conda-recipes in your PR ( |
I had thought the idea was to just let |
From the discussions and the conversations I've had with @jorisvandenbossche I thought we wanted to provide both Currently the error on I see your point and I am ok with both approaches, probably I would agree that 2 is a better future approach:
@jorisvandenbossche what are your thoughts? |
|
I haven't yet looked in detail, but some quick drive-by thoughts on the package naming:
A potential alternative is something in between:
|
Currently we can't import pyarrow without libparquet: I've opened apache/arrow#39006 |
I think some degree of breakage is unavoidable if we want to introduce a minimal pyarrow and default to it at some point. The question is whether the error messages that people encounter would be self-explanatory enough to trivially fix things.
If we really want to have not just It wouldn't be the first package with such a setup though, for example ray has So I guess it'd be reasonable to do |
|
Hi guys! So how's this progressing? 👍 |
I am currently working on releasing Arrow 15.0.0 which includes a fix to be able to extract |
c5a9ac2 to
ecf8418
Compare
|
Hi! This is the friendly automated conda-forge-linting service. I wanted to let you know that I linted all conda-recipes in your PR ( Here's what I've got... For recipe:
|
|
Hi! This is the friendly automated conda-forge-linting service. I just wanted to let you know that I linted all conda-recipes in your PR ( |
|
@jorisvandenbossche @h-vetinari I have rebased this PR with 15.0.0 so
I think this is good for a review at the moment. |
7f62f77 to
5958885
Compare
|
@jorisvandenbossche @raulcd I think we need to have a discussion about this on the Arrow mailing lists before we push this change. The Arrow community currently has little awareness that this change is planned. I worry that there could be much surprise and frustration if we do not inform the community in advance. |
|
@ianmcook I think this is a valid concern but the only affected users will be those using We could add a patch like: diff --git a/python/pyarrow/flight.py b/python/pyarrow/flight.py
index b183690..12eef59 100644
--- a/python/pyarrow/flight.py
+++ b/python/pyarrow/flight.py
@@ -65,5 +65,6 @@ try:
)
except ImportError as exc:
raise ImportError(
- f"The pyarrow installation is not built with support for 'flight' ({str(exc)})"
+ "The 'pyarrow' installation is not built with support for " +
+ f"'flight'. Please install the conda-forge 'pyarrow-all' package. ({str(exc)})"
) from Noneand users will get prompted something like: to be even more explicit. We can send an email to the mailing list in order to give a heads up but in my opinion we could merge. We did the split of |
|
Thanks @raulcd — a message to the user@ and dev@ mailing lists would be great. The patch to give a more helpful error message would also be great. |
5958885 to
3c6cc03
Compare
|
@h-vetinari I was having a chat with @xhochy at PyCon DE and decided to add the Arrow update to v16.0.0 in the same PR so there is not a build without the split. I hope this is also good with you. |
|
That's exactly how I would have suggested doing it! :) |
2999bbe to
6e21a4f
Compare
|
It seems we hit this issue: protocolbuffers/protobuf#14576 |
h-vetinari
left a comment
There was a problem hiding this comment.
This is getting close but still needs a bit of work. I also want to do #1375 first to uncouple the aws-migrations from the arrow bump. I'm happy to rebase this PR on top afterwards (commit history needs some cleanup as well), and apply the necessary clean-ups.
Also, this hasn't been rerendered yet for the new outputs, which is going to further increase the rerender time and effectively break the bot from opening new migrations (see conda-forge/conda-forge-pinning-feedstock#5815). This is not an issue with this PR itself, but something we should fix beforehand, or at the very least ASAP afterwards.
b4195b8 to
031b47d
Compare
|
Hi! This is the friendly automated conda-forge-linting service. I was trying to look for recipes to lint for you, but it appears we have a merge conflict. Please ping the 'conda-forge/core' team (using the @ notation in a comment) if you believe this is a bug. |
031b47d to
6166f4c
Compare
6166f4c to
24fe27f
Compare
|
I've completely messed up with the rebase to update the commit history which has closed the PR. I've created a new PR here: #1376 |
|
@raulcd is |
|
The compute kernels itself ( |
|
(we have an issue about splitting the compute kernels into its own shared library (apache/arrow#25025), and that would allow it here to be installed separately as well, but until then I think the compute kernels are all included (or not, if one would disable building then) in the main libarrow.so) |
Checklist
0(if the version changed)conda-smithy(Use the phrase@conda-forge-admin, please rerenderin a comment in this PR for automated rerendering)As discussed in #1201 this PR tries to add a new
pyarrow-basethat only depends onlibarrowandlibparquet.