-
-
Notifications
You must be signed in to change notification settings - Fork 6.5k
fix: typesense doc sync #11807
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: typesense doc sync #11807
Conversation
- Refactor workflow to delete temp collections. - Delete unsued facets from docsearch scraper config.
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
PR Code Suggestions ✨Latest suggestions up to eecce94
Previous suggestionsSuggestions up to commit 17ab414
Suggestions up to commit 16b9b5b
|
||||||||||||||||||||||||||||||||||
gioboa
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @G0maa it looks fine to me. Let's try this step in the main pipeline. 💪
alumni
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, but I'm not familiar with Typesense and our account setup. @dlhck and @naorpeled would be the best candidates to review :)
|
Nice @G0maa, seems the indexing is now running without failing 👏🏻 Does anyone have any idea how the docs are published? Seems that on typeorm.io the docs stopped being published sometime after v0.3.27. Is that right? Also: is Typesense indexing our markdown files or the website? Maybe in the next meeting we can discuss docs too - we should talk about publishing + indexing for 2-3 versions (0.3, master and next). |
For now 🙈
Sadly no idea.
Based on these lines it's indexing the website. I think I saw it somewhere that Typesense supports indexing docusaurus directly (but will have to verify).
Totally agree. |
gioboa
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
typesense-docsearch-scraper is the command for indexing the website. we have it in the workflow.
In the latest run there is the out of memory error as well.
That's not good 😅 |
|
@gioboa I noticed that, but it's in the part where it displays the recent logs (so I though that log might be from the previous runs, but I might be wrong). |
I run the task again, same error. :( |
|
Hmmmm... I'll try to figure out a workaround for this. I believe we shouldn't need to increase memory, TypeORM docs aren't that massive. Going through Typesense docs there's a way to limit the |
|
@G0maa that might actually be a good idea, since we go OOM, not out of disk space. Some of the pages are long (but still, they should not take MBs). |
Co-authored-by: Giorgio Boa <35845425+gioboa@users.noreply.github.com>
gioboa
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Co-authored-by: Giorgio Boa <35845425+gioboa@users.noreply.github.com>

Description of change
Currently typesnese index workflow fails a lot due to limited memory, this PR helps optimize memory usage a bit by:
Fixes #11799
I've tested this on my own free-tier Typesense cloud deployment and it worked, but please note that this is not guaranteed to fix
OUT_OF_MEMORYissues, we may eventually need to increase memory 😄Pull-Request Checklist
masterbranchFixes #00000