Data collections and collection references by bholmesdev · Pull Request #569 · withastro/roadmap

bholmesdev · 2023-05-02T02:17:26Z

Summary

This introduces two related features to content collections:

Data collections: A way to store JSON files as collections
References: A way to reference collection entries from another collection

Links

Fully rendered proposal 1 - data collections
Fully rendered proposal 2 - collection references
Stage 2 proposal: Referencing data from content collections #530
Stage 1 proposals: Data collections #477, Support for relational data in content collections #525

Visual.Studio.Code.-.config.ts.with-data.-.2.May.2023.mp4

proposals/0034-collection-references.md

FredKSchott · 2023-05-02T20:22:18Z

proposals/0033-data-collections.md

+- **We'd need to parse the whole array of entries** to determine entry IDs. This could be a performance bottleneck vs. pulling IDs from file names.
+- **It would be different from content collections,** which means a learning curve.
+
+Due to these, we decided against single-file for now. Though we do recognize the convenience of colocation that can be explored in the future.


+1 to exploring in the future. A solution that I'd love to see explored more in a future RFC that I think would kill a few birds with one stone:

+ import authorsData from '../authors.json'; const authors = defineCollection({ type: 'data', schema: z.object({ name: z.string(), twitter: z.string().url(), }), + data: authorsData, });

Just a thought / not at all blocking this RFC which I agree is smart to treat this as out-of-scope!

What's the use-case for importing like this? It won't work with remote data, I'm surprised at the suggestion.

@matthewp I assume this could allow users to parse data in any format they choose, or even write data in-line with their config. The downside: it would be possible for data to exist outside of content/, which may not feel consistent. Not sure I mind that inconsistency though.

proposals/0033-data-collections.md

FredKSchott · 2023-05-02T20:26:21Z

proposals/0033-data-collections.md

+</ul>
+```
+
+To retrieve individual entries, `getEntry()` can be used. This receives both the collection name and the entry `id` as described in the [Return type](#return-type). These can be passed as separate arguments or as object keys.


Huge fan of this new getEntry API! It's a clever solution to the ID vs. slug naming complexity you outline below.

FredKSchott · 2023-05-02T20:28:49Z

proposals/0033-data-collections.md

+export const collections = { authors };
+```
+
+These collections can also be queried using the `getCollection()` and `getEntry()` utilities:


You mention getEntries() in the video, but I did a quick search on the page and didn't see that here. Is that included or not? If included, it should be defined here!

Good point! We should add that 👍

Update: added to the references doc. getEntries() is meant for resolving references primarily and shouldn't be recommended for manual use. This is how a manual call would look:

const relatedPosts = await getEntries({ collection: 'blog', slug: 'post-1' }, { collection: 'blog', slug: 'post-2' }, ...);

In fact, we can document "if you want to retrieve multiple entries manually, we suggest getCollection() with a filter."

ematipico

First round of question about data collections

proposals/0033-data-collections.md

Co-authored-by: Emanuele Stoppa <my.burning@gmail.com>

…a-collections

tony-sull · 2023-05-09T20:47:23Z

proposals/0033-data-collections.md

+});
+```
+
+Then, we will update our type generator to recognize these data-specific file extensions. This should also raise errors when collections are misconfigured (i.e. `type: 'data'` is missing from the config file) and when a mix of content and data in the same collection is detected.


This should also raise errors ... a mix of content and data in the same collection is detected.

I could see this being a bit of a blocker for the easy upgrade path of a unified src/content directory

If I have a large data collection and only need to upgrade one to use body content for some specific use case, I'd end up having to update every item in the collection to .md files

I can't think of a great example of this at the moment so it may very well be a corner case, feel free to ignore if it isn't a use case worth covering!

Fair enough! There may be a case to allow content inside data collections, since data collections have a subset of content collections' properties. Though it sounds like you're proposing the other way. In that case, we could either mix return types based on the entry's file extension, or allow data collections in content. For the former, properties like render() and body would be added to every collection item and stubbed out with undefined's, which I worry about for usability. For the latter, mixed return types feel risky for usable types when calling getCollection(). We'd need to ship some type guards for appropriate filtering.

My thinking is this:

Move forward with collection-level types for launch

Consider allowing content collections in data collections as a fast follow

Look into the "singleton" discussion here (Singletons in the Content Collections API #449) for one-off .md files as you're describing here. I think there's a use case to allowing one-off entries with a shared schema, though it's something we could tackle separately.

Let me know if that makes sense, or if I'm missing anything!

Given that the RFC is focused on adding support for data collections and a migration path from data entries to content entries isn't a specific goal, I think we can save this question/idea for later!

I honestly don't have a great argument for a solution there. It'll be interesting to see how data collections get used and whether mixing content and data entries in the same collection is even a common enough use case to consider 👍

Yep, another case of "defensively make restrictive, open up if people want it" 👍

bholmesdev · 2023-05-10T15:49:09Z

Thanks for your contributions everyone! We're looking to reach consensus on this RFC by end of the week, so please share any final feedback (blocking or otherwise) that you want us to address. If we don't hear anything, that's our go-ahead to ship 🚢

bholmesdev · 2023-05-12T20:05:41Z

Well, I'll take your stunned silence as a "ship it!" Merging

bholmesdev added 5 commits May 1, 2023 15:12

new: summary, example, background, goals

593908b

chore: gitignore

49d3a0b

new: detailed design, drawbacks, alternatives, adoption

a8dae6d

new: collection references draft

d89e2a4

chore: link

01cd403

bholmesdev mentioned this pull request May 2, 2023

Data collections and references withastro/astro#6850

Merged

FredKSchott reviewed May 2, 2023

View reviewed changes

proposals/0034-collection-references.md Show resolved Hide resolved

FredKSchott reviewed May 2, 2023

View reviewed changes

proposals/0033-data-collections.md Show resolved Hide resolved

FredKSchott reviewed May 2, 2023

View reviewed changes

bholmesdev added 3 commits May 8, 2023 10:25

edit: allow reference() on schema function

3c17eff

edit: update to JSON or YAML

04a9e33

edit: add getEntries() to references

c5363ba

ematipico reviewed May 8, 2023

View reviewed changes

proposals/0033-data-collections.md Show resolved Hide resolved

proposals/0033-data-collections.md Outdated Show resolved Hide resolved

proposals/0033-data-collections.md Outdated Show resolved Hide resolved

proposals/0033-data-collections.md Outdated Show resolved Hide resolved

bholmesdev and others added 5 commits May 8, 2023 12:36

fix: typo "single0file"

3d11143

Co-authored-by: Emanuele Stoppa <my.burning@gmail.com>

new: drawbacks, alts, adoption strat

82fa785

Merge branch 'data-collections' of github.com:withastro/rfcs into dat…

50899d1

…a-collections

chore: remove now resolved question

819260f

edit: clarify ids with the same filename are not allowed

045bd49

bholmesdev marked this pull request as ready for review May 8, 2023 23:10

tony-sull reviewed May 9, 2023

View reviewed changes

chore: remove second reference() import option

bfb1be4

bholmesdev mentioned this pull request May 11, 2023

Data collections and collection references withastro/docs#3233

Merged

bholmesdev merged commit c7eb6d0 into main May 12, 2023

bholmesdev deleted the data-collections branch May 12, 2023 20:05

Conversation

bholmesdev commented May 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Links

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bholmesdev May 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ematipico left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bholmesdev May 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bholmesdev commented May 10, 2023

Uh oh!

bholmesdev commented May 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

bholmesdev commented May 2, 2023 •

edited

Loading

bholmesdev May 8, 2023 •

edited

Loading

bholmesdev May 10, 2023 •

edited

Loading