Skip to content

duplicates of site.data in db.json leads to large size #5260

@EmptyDreams

Description

@EmptyDreams

Check List

Please check followings before submitting a new issue.

Expected behavior

The db.json file should be a reasonable size. (Should it be under 50MB at the very least?)

Actual behavior

But in fact, even though I deleted most of the posts from my blog and kept only one post (the md file was 5KB in size), the resulting db.json was still over 70MB.

I'm not quite clear on why this happened, I didn't have this problem originally either, it seemed to come out of nowhere but I can't remember what I was modifying at the time.

Through my observation, there are a lot of duplicate _id & data fields in db.json, such as the source/_data/avatar/cpen.webp file, which is recorded more than 100 times in db.json, which takes up a lot of space.

This problem caused my hexo se and hexo g to wait quite a long time after the INFO Validating config step, and I had to run hexo cl or manually delete the db.json before each command, which clearly defeats the purpose of the db.json file.

Is the problem still there under "Safe mode"?

The problem persists even if I add the --safe directive.

Environment & Settings

Node.js & npm version(node -v && npm -v)

node version: 18.12.1

npm version: 9.8.1

Your site _config.yml (Optional)

Hexo and Plugin version(npm ls --depth 0)

Details

+-- @neilsustc/markdown-it-katex@1.0.0
+-- cheerio@1.0.0-rc.12
+-- gulp-clean@0.4.0
+-- gulp-cssnano@2.1.3
+-- gulp-html-minifier-terser@7.1.0
+-- gulp-htmlclean@2.7.22
+-- gulp-terser@2.1.0
+-- gulp@4.0.2
+-- hexo-abbrlink@2.2.1
+-- hexo-asset-image@1.0.0
+-- hexo-butterfly-envelope@1.0.15
+-- hexo-deployer-git@3.0.0
+-- hexo-filter-nofollow@2.0.2
+-- hexo-generator-archive@2.0.0
+-- hexo-generator-baidu-sitemap@0.1.9
+-- hexo-generator-category@2.0.0
+-- hexo-generator-feed@3.0.0
+-- hexo-generator-index@3.0.0
+-- hexo-generator-sitemap@3.0.1
+-- hexo-generator-tag@2.0.0
+-- hexo-graphviz@1.0.2
+-- hexo-log@3.2.0
+-- hexo-renderer-ejs@2.0.0
+-- hexo-renderer-markdown-it@7.1.0
+-- hexo-renderer-pug@3.0.0
+-- hexo-renderer-stylus@3.0.0
+-- hexo-server@3.0.0
+-- hexo-swpp@2.8.10
+-- hexo-wordcount@6.0.1
+-- hexo@6.3.0
+-- node-fetch@2.6.9
`-- prismjs@1.29.0

Your package.json package.json

detail

{
  "name": "hexo-site",
  "version": "0.0.0",
  "private": true,
  "scripts": {
    "build": "hexo generate",
    "clean": "hexo clean",
    "deploy": "hexo deploy",
    "server": "hexo server"
  },
  "hexo": {
    "version": "6.3.0"
  },
  "dependencies": {
    "@neilsustc/markdown-it-katex": "^1.0.0",
    "cheerio": "^1.0.0-rc.12",
    "hexo": "^6.3.0",
    "hexo-abbrlink": "^2.2.1",
    "hexo-asset-image": "^1.0.0",
    "hexo-butterfly-envelope": "^1.0.15",
    "hexo-deployer-git": "^3.0.0",
    "hexo-filter-nofollow": "^2.0.2",
    "hexo-generator-archive": "^2.0.0",
    "hexo-generator-baidu-sitemap": "^0.1.9",
    "hexo-generator-category": "^2.0.0",
    "hexo-generator-feed": "^3.0.0",
    "hexo-generator-index": "^3.0.0",
    "hexo-generator-sitemap": "^3.0.1",
    "hexo-generator-tag": "^2.0.0",
    "hexo-graphviz": "^1.0.2",
    "hexo-log": "^3.0.0",
    "hexo-renderer-ejs": "^2.0.0",
    "hexo-renderer-markdown-it": "^7.1.0",
    "hexo-renderer-pug": "^3.0.0",
    "hexo-renderer-stylus": "^3.0.0",
    "hexo-server": "^3.0.0",
    "hexo-swpp": "^2.8.10",
    "hexo-wordcount": "^6.0.1",
    "node-fetch": "^2.6.9",
    "prismjs": "^1.29.0"
  },
  "devDependencies": {
    "gulp": "^4.0.2",
    "gulp-clean": "^0.4.0",
    "gulp-cssnano": "^2.1.3",
    "gulp-html-minifier-terser": "^7.1.0",
    "gulp-htmlclean": "^2.7.22",
    "gulp-terser": "^2.1.0"
  }
}

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions