Shared extractor: support file path globs #13969
Open
+230
−98
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Update the tree-sitter extractor to support file globs. This replaces the existing
file_extensionsfield with afile_globsfield, which supports UNIX style glob patterns powered by theglobsetcrate.This allows files with no extension (e.g. Dockerfiles) to be extracted
by specifying a glob such as
*Dockerfile.One surprising aspect of this change is that the globs match against the
whole path, rather than just the file name. I'm not sure if this is an issue we should work around, or if it's OK.
@aibaars I'd be interested in your thoughts on this.
Fixes #13964