Skip to content

feat: add detection for doc, xls, ppt#5

Merged
Borewit merged 1 commit intomasterfrom
ajay/doc-xls-ppt-detection
Jan 6, 2026
Merged

feat: add detection for doc, xls, ppt#5
Borewit merged 1 commit intomasterfrom
ajay/doc-xls-ppt-detection

Conversation

@ajayvignesh01
Copy link
Collaborator

Implements CFBF detection for legacy Office formats (.doc, .xls, .ppt)

This adds proper file type detection by:

  • Validating the CFBF magic signature (D0 CF 11 E0...)
  • Parsing the header to locate the directory sector
  • Reading the root entry CLSID to identify the specific Office format

Supports Word 97-2003 (.doc), Excel 97-2003 (.xls with 2 CLSID variants), and PowerPoint 97-2003 (.ppt) with correct MIME types. The CLSID-based approach can be easily extended to support other CFBF formats like .msg, .msi, or .vsd in the future.

@Borewit Borewit merged commit 1383a64 into master Jan 6, 2026
8 checks passed
@Borewit Borewit deleted the ajay/doc-xls-ppt-detection branch January 6, 2026 17:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants