<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Dual Lab</title>
	<atom:link href="https://duallab.com/feed/" rel="self" type="application/rss+xml" />
	<link>https://duallab.com</link>
	<description>To connect science and technology</description>
	<lastBuildDate>Thu, 11 Jun 2026 11:33:32 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.9.26</generator>

<image>
	<url>https://duallab.com/wp-content/uploads/2018/03/dl.png</url>
	<title>Dual Lab</title>
	<link>https://duallab.com</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Metadata and PDF accessibility</title>
		<link>https://duallab.com/metadata-and-pdf-accessibility/</link>
		<comments>https://duallab.com/metadata-and-pdf-accessibility/#respond</comments>
		<pubDate>Thu, 11 Jun 2026 11:17:52 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[PDF4WCAG]]></category>
		<category><![CDATA[Products]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7346</guid>
		<description><![CDATA[PDF accessibility is always associated with tags, headings and alternative text. But there&#8217;s another critical component: metadata. PDF documents may include general...]]></description>
				<content:encoded><![CDATA[<p><a href="https://pdf4wcag.com/blog-news/metadata-and-pdf-accessibility">PDF accessibility</a> is always associated with tags, headings and alternative text. But there&#8217;s another critical component: <strong>metadata</strong>.</p>
<p>PDF documents may include general information, such as the document’s title, author, and creation and modification dates. Such information about the document (as opposed to its content or structure) is called <strong>metadata</strong> and is intended to assist in cataloguing and searching for documents in external databases.</p>
<p>Metadata plays a tremendous role in modern PDF files, especially in accessibility, document management and AI-based document processing. In PDF files metadata is commonly stored using <strong>XMP (Extensible Metadata Platform) package, directly embedded into the document.</strong></p>
<p>&nbsp;</p>
<h3 id="document-title-and-accessibility">Document title and accessibility</h3>
<p>&nbsp;</p>
<p><em>Well-Tagged PDF (WTPDF) declarations are metadata, embedded in PDF 2.0 files within the XMP metadata, that assert a document&#8217;s conformity with <a href="https://pdfa.org/wtpdf/">WTPDF 1.0 requirements</a> for accessibility or content reuse. Developed by the PDF Association, these declarations allow software to identify if a file is optimized for assistive technology (similar to PDF/UA-2) or for structured data extraction.</em></p>
<p>The title helps users understand the purpose of the document before reading its content. Screen readers and other assistive technologies often announce the title when the PDF is opened.</p>
<p>&nbsp;</p>
<p><strong>For example:</strong></p>
<p>“Accessibility Report 2026”<br />
“PDF4WCAG PDF Accessibility Checker”</p>
<p><strong>are significantly more useful than:</strong></p>
<p>“doc.pdf”<br />
“pic001.pdf”</p>
<p><img class=" wp-image-7347 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/image-210x300.png" alt="" width="239" height="341" srcset="https://duallab.com/wp-content/uploads/2026/06/image-210x300.png 210w, https://duallab.com/wp-content/uploads/2026/06/image.png 452w" sizes="(max-width: 239px) 100vw, 239px" /></p>
<h3></h3>
<h3 id="pdfua-identification-metadata">PDF/UA identification metadata</h3>
<p>&nbsp;</p>
<p>In accessible PDFs, XMP metadata may also contain identification information about conformance standards. There are several mechanisms at work here: one used by PDF/UA, another by WCAG. Both are important, as the document may conform to both PDF/UA and PDF/UA, as the latest LaTeX-generated Tagged PDFs do.</p>
<p><img class=" wp-image-7348 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/document-title-300x135.png" alt="" width="378" height="170" srcset="https://duallab.com/wp-content/uploads/2026/06/document-title-300x135.png 300w, https://duallab.com/wp-content/uploads/2026/06/document-title-768x346.png 768w, https://duallab.com/wp-content/uploads/2026/06/document-title-1024x461.png 1024w" sizes="(max-width: 378px) 100vw, 378px" /></p>
<p>This metadata allows validators and accessibility tools to determine whether the document claims compliance with standards such as: PDF/UA and WCAG.</p>
<h3></h3>
<p>&nbsp;</p>
<h3 id="additional-metadata-fields">Additional metadata fields</h3>
<p>&nbsp;</p>
<p>XMP metadata also may contain valuable document information, including: creation and modification date, author or organization, producer and creator tool, language information.</p>
<p>Metadata provides assistive technologies with an initial description of the document before content navigation begins. Without proper metadata, accessible PDFs lose important semantic and usability information.</p>
<p><img class=" wp-image-7349 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/xmp-metadata-300x101.png" alt="" width="428" height="144" srcset="https://duallab.com/wp-content/uploads/2026/06/xmp-metadata-300x101.png 300w, https://duallab.com/wp-content/uploads/2026/06/xmp-metadata-768x257.png 768w, https://duallab.com/wp-content/uploads/2026/06/xmp-metadata-1024x343.png 1024w" sizes="(max-width: 428px) 100vw, 428px" /></p>
<h3></h3>
<h3 id="what-pdf4wcag-checks">What PDF4WCAG checks</h3>
<p>&nbsp;</p>
<p><a href="https://pdf4wcag.com/"><strong>PDF4WCAG</strong></a> checks:</p>
<ul>
<li>dc:title is present and not empty.</li>
<li>The PDF/UA or WCAG compliance declarations, if the document is validated against PDF/UA or WCAG profiles respectively. These declarations are recommended, but not mandatory for WCAG.</li>
<li>The XMP package is properly attached to the document catalog.</li>
</ul>
<p><img class="size-medium wp-image-7350 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/xmp-300x107.png" alt="" width="300" height="107" srcset="https://duallab.com/wp-content/uploads/2026/06/xmp-300x107.png 300w, https://duallab.com/wp-content/uploads/2026/06/xmp.png 595w" sizes="(max-width: 300px) 100vw, 300px" /></p>
<p><img class=" wp-image-7351 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/additional-metadata-fields-147x300.png" alt="" width="168" height="343" srcset="https://duallab.com/wp-content/uploads/2026/06/additional-metadata-fields-147x300.png 147w, https://duallab.com/wp-content/uploads/2026/06/additional-metadata-fields.png 369w" sizes="(max-width: 168px) 100vw, 168px" /></p>
<p>&nbsp;</p>
<p><strong>Accessible PDFs</strong> should contain a meaningful <em>dc:title</em>. More advanced workflows should also include standardized identification metadata and descriptive document properties to support both human users and machine processing systems.</p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/metadata-and-pdf-accessibility/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>VeraPDF 1.30.2: Patch Release</title>
		<link>https://duallab.com/verapdf-1-30-2-patch-release/</link>
		<comments>https://duallab.com/verapdf-1-30-2-patch-release/#respond</comments>
		<pubDate>Wed, 10 Jun 2026 12:06:38 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[Products]]></category>
		<category><![CDATA[veraPDF]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7340</guid>
		<description><![CDATA[VeraPDF 1.30.2 is out now. This patched release is in accordance with a new security policy adopted by veraPDF. This release comprises the following...]]></description>
				<content:encoded><![CDATA[<p><img class=" wp-image-7341 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-4-300x168.png" alt="" width="420" height="235" srcset="https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-4-300x168.png 300w, https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-4-768x430.png 768w, https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-4-1024x573.png 1024w, https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-4.png 1600w" sizes="(max-width: 420px) 100vw, 420px" /></p>
<p><strong>VeraPDF</strong> 1.30.2 is out now. This patched release is in accordance with <a href="https://github.com/veraPDF/veraPDF-library/blob/integration/SECURITY.md">a new security policy</a> adopted by veraPDF.</p>
<p><strong>This release comprises the following fixes:</strong></p>
<ul>
<li aria-level="1">Secure XML parser for RichText values and XFA</li>
<li aria-level="1">Secure PostScript parser for ToUnicode CMaps and PS Type1 fonts</li>
</ul>
<p>To learn more about the new fixes and features, read <a href="https://github.com/veraPDF/veraPDF-library/releases/tag/v1.30.2">the release notes</a>. To read more about these vulnerabilities, read the advisories <a href="https://github.com/veraPDF/veraPDF-parser/security/advisories/GHSA-7c26-995w-6f47">1</a> <a href="https://github.com/veraPDF/veraPDF-parser/security/advisories/GHSA-jrmc-qg6p-94fp">2</a> <a href="https://github.com/veraPDF/veraPDF-validation/security/advisories/GHSA-cg9x-g3gm-h5h6">3</a> <a href="https://github.com/veraPDF/veraPDF-validation/security/advisories/GHSA-36mm-w85j-3q2j">4</a> <a href="https://github.com/veraPDF/veraPDF-validation/security/advisories/GHSA-3jh7-wm29-q568">5</a>.</p>
<p>These updates continue to develop veraPDF in alignment with the recent <a href="https://digital-strategy.ec.europa.eu/en/policies/cra-open-source">European Cyber Resilience Act</a>, which aims to make devices and software safer by implementing more rigorous cybersecurity, documentation, and vulnerability reporting requirements.</p>
<h3>Download</h3>
<p>Click here to download: <a href="https://software.verapdf.org/releases/1.30/verapdf-greenfield-1.30.2-installer.zip">veraPDF 1.30.2</a></p>
<p>or pull the latest Docker images <a href="https://hub.docker.com/r/verapdf/rest">verapdf/rest</a> and <a href="https://hub.docker.com/r/verapdf/cli">verapdf/cli</a>.</p>
<p>Please report any bugs on the <a href="https://github.com/veraPDF/veraPDF-library/issues">veraPDF issue tracker on GitHub</a> or <a href="https://openpreservation.org/about/contact">contact us</a> with any feedback or questions.</p>
<h3>About veraPDF</h3>
<p><strong>veraPDF</strong> is an open-source PDF/A validator covering all parts of the PDF/A and PDF/UA (<a href="https://pdfa.org/matterhorn-protocol/">Matterhorn Protocol</a> Machine failure conditions) standards. Originally funded by the PREFORMA project, <strong>veraPDF</strong> has been sustained and maintained by the <a href="https://openpreservation.org/">Open Preservation Foundation</a> since 2017. <a href="https://duallab.com/">Dual Lab</a> provides active user support and carries out maintenance and bug fixes. The <a href="https://pdfa.org/">PDF Association</a>’s <a href="https://www.pdfa.org/community/pdf-a-technical-working-group/">PDF/A Technical Working Group</a> continues its role in resolving ambiguities arising from veraPDF’s usage in the field.</p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/verapdf-1-30-2-patch-release/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OpenDataLoader PDF hits #1 on GitHub Trending (Java)!</title>
		<link>https://duallab.com/opendataloader-pdf-hits-1-on-github-trending-java/</link>
		<comments>https://duallab.com/opendataloader-pdf-hits-1-on-github-trending-java/#respond</comments>
		<pubDate>Fri, 05 Jun 2026 05:57:29 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[OpenDataLoader]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7335</guid>
		<description><![CDATA[OpenDataLoader PDF just hit #1 on GitHub Trending (Java) this week! An open-source PDF parser that turns documents into AI-ready data...]]></description>
				<content:encoded><![CDATA[<p><a href="https://github.com/opendataloader-project/opendataloader-pdf?utm_source=x">OpenDataLoader PDF</a> just hit #1 on GitHub Trending (Java) this week!<br />
An <a href="https://opendataloader.org/">open-source PDF parser</a> that turns documents into AI-ready data and automates PDF accessibility along the way.<br />
<strong>+2,000</strong> a week <img src="https://s.w.org/images/core/emoji/11/72x72/2728.png" alt="✨" class="wp-smiley" style="height: 1em; max-height: 1em;" /><br />
<strong>23K+</strong> total <img src="https://s.w.org/images/core/emoji/11/72x72/2728.png" alt="✨" class="wp-smiley" style="height: 1em; max-height: 1em;" /> on GitHub<br />
A huge thank-you to every contributor who made this possible, this milestone is yours !</p>
<p>&nbsp;</p>
<p><img class=" wp-image-7336 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/HKBrYrKa8AAcnKG-206x300.jpg" alt="" width="336" height="489" srcset="https://duallab.com/wp-content/uploads/2026/06/HKBrYrKa8AAcnKG-206x300.jpg 206w, https://duallab.com/wp-content/uploads/2026/06/HKBrYrKa8AAcnKG-768x1119.jpg 768w, https://duallab.com/wp-content/uploads/2026/06/HKBrYrKa8AAcnKG-703x1024.jpg 703w, https://duallab.com/wp-content/uploads/2026/06/HKBrYrKa8AAcnKG.jpg 1078w" sizes="(max-width: 336px) 100vw, 336px" /></p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/opendataloader-pdf-hits-1-on-github-trending-java/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dual Lab releases PDF4WCAG Accessibility Checker 1.10</title>
		<link>https://duallab.com/dual-lab-releases-pdf4wcag-accessibility-checker-1-10/</link>
		<comments>https://duallab.com/dual-lab-releases-pdf4wcag-accessibility-checker-1-10/#respond</comments>
		<pubDate>Thu, 04 Jun 2026 13:38:47 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[PDF4WCAG]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7330</guid>
		<description><![CDATA[Dual Lab announces the release of PDF4WCAG Accessibility Checker 1.10, introducing usability enhancements, expanded localization support, and new document inspection panels. PDF4WCAG is...]]></description>
				<content:encoded><![CDATA[<p><img class=" wp-image-7331 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-2-1-300x168.png" alt="" width="384" height="215" srcset="https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-2-1-300x168.png 300w, https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-2-1-768x430.png 768w, https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-2-1-1024x573.png 1024w, https://duallab.com/wp-content/uploads/2026/06/Dual-Lab-Launches-Quarterly-Reports-on-PDF-Accessibility-Trends-based-on-Common-Crawl-data-2-1.png 1600w" sizes="(max-width: 384px) 100vw, 384px" /></p>
<p><strong>Dual Lab</strong> announces the release of <strong><a href="https://pdf4wcag.com/">PDF4WCAG</a> Accessibility Checker 1.10</strong>, introducing usability enhancements, expanded localization support, and new document inspection panels.</p>
<p><strong>PDF4WCAG</strong> is a professional accessibility validation solution for PDF documents, designed to support compliance with PDF/UA, WCAG, and WTPDF accessibility requirements. It is powered by the <a href="https://verapdf.org/"><strong>veraPDF</strong></a> validation architecture and is identical to <strong>veraPDF</strong> in Machine verifiable checks of <strong>PDF/UA and WTPDF validation profiles</strong>.</p>
<h2 id="whats-new-in-version-110">What’s new in Version 1.10</h2>
<p>&nbsp;</p>
<h3>Enhanced localization and user experience</h3>
<p><strong>PDF4WCAG</strong> 1.10 improves interface usability and multilingual support:</p>
<ul>
<li>Redesigned switching between <strong>technical terminology</strong> and <strong>user-friendly language</strong>, providing a more intuitive experience for both accessibility experts (developers) and non-technical users.</li>
<li>Added support for <strong>German</strong> and <strong>Dutch</strong> interface localizations.</li>
</ul>
<p>&nbsp;</p>
<h3>Improved zoom and navigation controls</h3>
<p>Accessibility issue navigation has been refined for better usability:</p>
<ul>
<li>Enhanced zoom behavior for small issue regions and error highlights.</li>
</ul>
<h3></h3>
<h3>New inspection panels</h3>
<p><strong>PDF4WCAG 1.10</strong> introduces several new analysis panels to provide deeper document insights:</p>
<p><strong>Annotations panel</strong></p>
<p>Inspects PDF annotations, comments, hyperlinks, form controls, and other interactive elements relevant to accessibility and usability evaluation.</p>
<p><strong>Metadata panel</strong></p>
<p>Displays document metadata including:</p>
<ul>
<li>document title</li>
<li>author information</li>
<li>document language</li>
<li>accessibility properties</li>
<li>PDF/UA-related metadata entries</li>
</ul>
<p><strong>Fonts panel</strong></p>
<p>Provides detailed analysis of:</p>
<ul>
<li>embedded fonts</li>
<li>font types and subsets</li>
<li>encoding information</li>
</ul>
<h3></h3>
<h3>Persistent user preferences</h3>
<p><strong>PDF4WCAG</strong> now preserves user configuration settings between sessions, improving workflow continuity and efficiency. Persisted settings include:</p>
<ul>
<li>selected interface language</li>
<li>active filters</li>
<li>right-side panel state and opened sections</li>
<li>structure tree role map visibility</li>
<li>auto-scaling preferences</li>
</ul>
<h3></h3>
<h3>CLI enhancements</h3>
<p>The command-line interface has been extended with initial support for additional validation profiles:</p>
<ul>
<li><strong>WCAG Machine</strong></li>
<li><strong>WCAG Machine &amp; Human</strong></li>
</ul>
<p>These profiles are now available under paid commercial licenses on the <a href="https://pdf4wcag.com/licensing/">PDF4WCAG website</a>.</p>
<h3></h3>
<h3>Public API documentation</h3>
<p>A new public documentation section is now <a href="https://pdf4wcag.com/documentation/local-deployment">available</a>. API is available  under paid commercial licenses on the <a href="https://pdf4wcag.com/licensing/">PDF4WCAG website</a>.</p>
<h3></h3>
<h3>Integration API Beta testing</h3>
<p>The <a href="https://pdf4wcag.com/documentation/api-references"><strong>PDF4WCAG Integration API</strong></a> is in the process of beta testing. The <a href="https://pdf4wcag.com/documentation/api-references">API</a> is designed to simplify integration of accessibility validation workflows into enterprise systems, document processing pipelines, and third-party accessibility platforms.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/dual-lab-releases-pdf4wcag-accessibility-checker-1-10/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>PDF Association Webinar Speaker: Boris Doubrov</title>
		<link>https://duallab.com/boris-doubrov-at-the-webinar-accessible-mathematical-content-in-pdf/</link>
		<comments>https://duallab.com/boris-doubrov-at-the-webinar-accessible-mathematical-content-in-pdf/#respond</comments>
		<pubDate>Wed, 03 Jun 2026 07:06:06 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Team]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7323</guid>
		<description><![CDATA[Boris Doubrov, CEO of Dual Lab, will present at the webinar &#8220;Accessible Mathematical Content in PDF&#8221;. This webinar, offered by the...]]></description>
				<content:encoded><![CDATA[<p>Boris Doubrov, CEO of Dual Lab, will present at the webinar <a href="https://pdfa.org/event/webinar-accessible-mathematical-content-in-pdf/"><strong>&#8220;Accessible Mathematical Content in PDF&#8221;.</strong></a></p>
<p><img class=" wp-image-7324 aligncenter" src="https://duallab.com/wp-content/uploads/2026/06/1780421098905-300x60.jpeg" alt="" width="607" height="121" srcset="https://duallab.com/wp-content/uploads/2026/06/1780421098905-300x60.jpeg 300w, https://duallab.com/wp-content/uploads/2026/06/1780421098905-768x152.jpeg 768w, https://duallab.com/wp-content/uploads/2026/06/1780421098905-1024x203.jpeg 1024w, https://duallab.com/wp-content/uploads/2026/06/1780421098905.jpeg 1280w" sizes="(max-width: 607px) 100vw, 607px" /></p>
<p>This webinar, offered by the members of the LaTeX Project LWG, introduces the PDF Association’s recently-published <strong>Best Practice Guide: Math in PDF and takes questions from attendees.</strong></p>
<p>Date &amp; Time: 08:00 ET 11:00 ET 17:00 CET</p>
<p><strong><img src="https://s.w.org/images/core/emoji/11/72x72/1f3af.png" alt="🎯" class="wp-smiley" style="height: 1em; max-height: 1em;" /> Topics Covered</strong><br />
<img src="https://s.w.org/images/core/emoji/11/72x72/1f4a2.png" alt="💢" class="wp-smiley" style="height: 1em; max-height: 1em;" /> Why we’re doing this: poor user experience when reading math<br />
<img src="https://s.w.org/images/core/emoji/11/72x72/1f4a2.png" alt="💢" class="wp-smiley" style="height: 1em; max-height: 1em;" /> Why PDF 2.0 changes everything<br />
<img src="https://s.w.org/images/core/emoji/11/72x72/1f4a2.png" alt="💢" class="wp-smiley" style="height: 1em; max-height: 1em;" /> A (brief!) introduction to MathML<br />
<img src="https://s.w.org/images/core/emoji/11/72x72/1f4a2.png" alt="💢" class="wp-smiley" style="height: 1em; max-height: 1em;" /> Relevant standards and the PDF Association’s new guidance<br />
<img src="https://s.w.org/images/core/emoji/11/72x72/1f4a2.png" alt="💢" class="wp-smiley" style="height: 1em; max-height: 1em;" /> Implications for publishers and developers</p>
<p>The presentation will consume approximately 25-30 minutes and will be followed by live Q&amp;A.</p>
<p><strong>Presenters</strong></p>
<ul>
<li>David Carlisle, <a href="https://pdfa.org/member/the-latex-project/">LaTeX Project</a> &amp; editor of <a class="extlink https" href="https://www.w3.org/TR/MathML/mathml.html">MathML</a> at <a class="extlink https" href="https://www.w3.org/">W3C</a></li>
<li>Boris Doubrov, <a href="https://pdfa.org/member/dual-lab-sprl/">Dual Lab</a> &amp; chair, <a href="https://pdfa.org/community/latex-project-lwg/">LaTeX Project LWG</a></li>
<li>Frank Mittelbach, <a href="https://pdfa.org/member/the-latex-project/">LaTeX Project</a></li>
<li>Neil Soiffer, creator of <a class="extlink https" href="https://daisy.github.io/MathCAT/" target="_blank" rel="noopener">MathCAT</a>, co-chair of the <a class="extlink https" href="https://www.w3.org/groups/wg/math/">Math WG</a> at <a class="extlink https" href="https://www.w3.org/">W3C</a></li>
</ul>
<p><strong>Registration</strong></p>
<p><a class="extlink https" href="https://zoom.us/webinar/register/WN_--B36VeTTbaW-yu5RLVkhw">Register</a> for the webinar!</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/boris-doubrov-at-the-webinar-accessible-mathematical-content-in-pdf/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is an Artifact in PDF?</title>
		<link>https://duallab.com/what-is-an-artifact-in-pdf/</link>
		<comments>https://duallab.com/what-is-an-artifact-in-pdf/#respond</comments>
		<pubDate>Fri, 29 May 2026 06:33:01 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[PDF4WCAG]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7315</guid>
		<description><![CDATA[PDF artifacts are non-semantic visual elements introduced during document generation, rendering, scanning, or OCR processing. In AI pipelines, these artifacts reduce extraction...]]></description>
				<content:encoded><![CDATA[<p><img class=" wp-image-7316 aligncenter" src="https://duallab.com/wp-content/uploads/2026/05/what-is-an-artifact-300x168.png" alt="" width="395" height="221" srcset="https://duallab.com/wp-content/uploads/2026/05/what-is-an-artifact-300x168.png 300w, https://duallab.com/wp-content/uploads/2026/05/what-is-an-artifact-768x430.png 768w, https://duallab.com/wp-content/uploads/2026/05/what-is-an-artifact-1024x573.png 1024w, https://duallab.com/wp-content/uploads/2026/05/what-is-an-artifact.png 1600w" sizes="(max-width: 395px) 100vw, 395px" /></p>
<p><strong>PDF artifacts</strong> are non-semantic visual elements introduced during document generation, rendering, scanning, or OCR processing. In AI pipelines, these artifacts reduce extraction quality and negatively impact downstream tasks such as embeddings, retrieval, and LLM reasoning.</p>
<h2 id="typical-pdf-artifacts-include">Typical PDF artifacts include:</h2>
<ul>
<li>page header/footer</li>
<li>table headers for multi-page tables</li>
<li>decorative elements interpreted as content</li>
</ul>
<p><em><strong>Artifacts should generally be ignored by assistive technologies such as: screen readers, text-to-speech systems, accessibility APIs, AI semantic extraction pipelines.</strong></em></p>
<p>This concept is very similar to decorative elements in HTML accessibility.</p>
<p>For example, in HTML: decorative images use alt=&#8221;&#8221;, layout containers may use ARIA presentation roles, CSS-generated visuals are ignored semantically. In PDFs, the equivalent mechanism is marking content as an <strong>Artifact.</strong></p>
<p>By the way <strong>artifacts play a critical role in PDF/UA compliance and screen reader usability.</strong> Without proper artifact handling, assistive technologies may read decorative or repetitive content aloud, creating confusion and misunderstandings for users.</p>
<p>Modern accessibility validation tools such as <a href="https://pdf4wcag.com/"><strong>PDF4WCAG Accessibility Checker</strong></a> help identify these issues and ensure PDFs correctly distinguish meaningful content from decorative elements.</p>
<p><strong>The core requirement of both PDF/UA and WCAG</strong> is that every piece of content must be designated either as an artifact or as part of the structure tree nothing can be left. This is exactly what PDF4WCAG verifies.</p>
<p><img class=" wp-image-7317 aligncenter" src="https://duallab.com/wp-content/uploads/2026/05/artifacts--300x168.png" alt="" width="496" height="278" srcset="https://duallab.com/wp-content/uploads/2026/05/artifacts--300x168.png 300w, https://duallab.com/wp-content/uploads/2026/05/artifacts--768x430.png 768w, https://duallab.com/wp-content/uploads/2026/05/artifacts--1024x573.png 1024w, https://duallab.com/wp-content/uploads/2026/05/artifacts-.png 1600w" sizes="(max-width: 496px) 100vw, 496px" /></p>
<h2 id="sample-of-artifact-errors-after-pdf4wcag-validation">Sample of Artifact errors after PDF4WCAG validation</h2>
<p><img class=" wp-image-7318 aligncenter" src="https://duallab.com/wp-content/uploads/2026/05/errors-300x244.png" alt="" width="417" height="339" srcset="https://duallab.com/wp-content/uploads/2026/05/errors-300x244.png 300w, https://duallab.com/wp-content/uploads/2026/05/errors-768x625.png 768w, https://duallab.com/wp-content/uploads/2026/05/errors-1024x833.png 1024w, https://duallab.com/wp-content/uploads/2026/05/errors.png 1178w" sizes="(max-width: 417px) 100vw, 417px" /></p>
<p><img class=" wp-image-7319 aligncenter" src="https://duallab.com/wp-content/uploads/2026/05/overview-300x197.png" alt="" width="471" height="309" srcset="https://duallab.com/wp-content/uploads/2026/05/overview-300x197.png 300w, https://duallab.com/wp-content/uploads/2026/05/overview-768x503.png 768w, https://duallab.com/wp-content/uploads/2026/05/overview-1024x671.png 1024w" sizes="(max-width: 471px) 100vw, 471px" /></p>
<h2 id="pdf-20-and-richer-artifact-semantics">PDF 2.0 and richer artifact semantics</h2>
<p>PDF 2.0 (ISO 32000-2:2020) brought significant improvements to the handling and definition of artifacts compared to previous versions.</p>
<p>Key improvements to the Artifact model in PDF 2.0 include:</p>
<ul>
<li>Standardized Tagging: PDF 2.0 provides clearer, more robust mechanisms for marking items as artifacts, especially in tagged PDF, reducing ambiguity for accessibility tools.</li>
<li>Reduced Vague Wording: It addresses ambiguities in earlier PDF 1.7 specifications, providing clearer rules for how developers and software should handle artifacts.</li>
<li>Better Annotation Handling: Annotations and their relation to structural elements are better defined, reducing issues where background decorations or marginalia are misidentified as content.</li>
<li>Improved Structural Hierarchy: It clarifies how artifacted content can interact with the document structure tree, particularly regarding how tags should be ordered or ignored, which was a point of ambiguity in older standards.</li>
</ul>
<p><em><strong>To sum it up, proper use of artifacts is one of the foundational concepts of PDF accessibility.</strong></em><br />
<em><strong>A well-structured accessible PDF must clearly separate: meaningful semantic content and decorative or auxiliary presentation elements.</strong></em></p>
<p>As PDF accessibility evolves, especially with PDF 2.0 semantics and AI-driven document processing, artifact classification becomes increasingly important not only for accessibility specialists, but also for developers, publishers, and AI engineers building intelligent document systems.</p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/what-is-an-artifact-in-pdf/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>PDF Week London 2026!</title>
		<link>https://duallab.com/pdf-week-london-2026/</link>
		<comments>https://duallab.com/pdf-week-london-2026/#respond</comments>
		<pubDate>Wed, 13 May 2026 13:05:56 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Team]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7311</guid>
		<description><![CDATA[Boris Doubrov, CEO of Dual Lab, took part in PDF Week London 2026. PDF Week comes three times each year; twice for...]]></description>
				<content:encoded><![CDATA[<p><strong>Boris Doubrov, </strong>CEO of <strong>Dual Lab</strong>, took part in <a href="https://pdfa.org/event/pdf-week-spring-2026/">PDF Week London 2026</a>.</p>
<p><img class="alignnone size-medium wp-image-7312" src="https://duallab.com/wp-content/uploads/2026/05/1778622139638-300x169.jpeg" alt="" width="300" height="169" srcset="https://duallab.com/wp-content/uploads/2026/05/1778622139638-300x169.jpeg 300w, https://duallab.com/wp-content/uploads/2026/05/1778622139638-768x433.jpeg 768w, https://duallab.com/wp-content/uploads/2026/05/1778622139638.jpeg 800w" sizes="(max-width: 300px) 100vw, 300px" /></p>
<p><strong>PDF Week</strong> comes three times each year; twice for &#8220;in-person&#8221; events and once online, in a series of meetings that (actually) takes about two weeks.</p>
<p>The in-person <strong>PDF Weeks</strong> offer attendees a variety of opportunities, including:​</p>
<ul>
<li aria-level="1">Engaging directly with industry peers, international representatives and institutional end-users</li>
<li aria-level="1">In-person participation in the industry&#8217;s Technical and Liaison Working Group meetings</li>
<li aria-level="1">The opportunity to observe and contribute to ISO standardization processes</li>
</ul>
<p>The event is designed to foster connections between ongoing technical activities within the PDF Association and broader ISO standardization efforts.​</p>
<p>Boris Doubrov is Chair of these communities: <a href="https://pdfa.org/community/latex-project-lwg/">LaTeX Project LWG</a> and <a href="https://pdfa.org/community/pdf-a-technical-working-group/">PDF/A TWG</a>.</p>
<p>Here is a <a href="https://pdfa.org/cheers-from-london/">short summary of the week.</a></p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/pdf-week-london-2026/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>VeraPDF and Arlington 1.30 full release out now</title>
		<link>https://duallab.com/verapdf-and-arlington-1-30-full-release-out-now/</link>
		<comments>https://duallab.com/verapdf-and-arlington-1-30-full-release-out-now/#respond</comments>
		<pubDate>Fri, 24 Apr 2026 07:48:40 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[Products]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[veraPDF]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7306</guid>
		<description><![CDATA[v1.30 release of the veraPDF and Arlington tools are out now!    The new version addresses technical depth and various real...]]></description>
				<content:encoded><![CDATA[<blockquote><p><strong>v1.30 release of the veraPDF and Arlington tools are out now!</strong></p>
<p><img class="alignnone size-medium wp-image-7243" src="https://duallab.com/wp-content/uploads/2026/02/veraPDF-Web-Demonstrator-e1739281803176-317x160-300x151.jpg" alt="" width="300" height="151" srcset="https://duallab.com/wp-content/uploads/2026/02/veraPDF-Web-Demonstrator-e1739281803176-317x160-300x151.jpg 300w, https://duallab.com/wp-content/uploads/2026/02/veraPDF-Web-Demonstrator-e1739281803176-317x160.jpg 317w" sizes="(max-width: 300px) 100vw, 300px" />  <img class="alignnone wp-image-7244" src="https://duallab.com/wp-content/uploads/2026/02/arlington-e1746619831570.png" alt="" width="147" height="152" /></p>
<p><em>The new version addresses technical depth and various real world issues reported by the community. <a href="https://verapdf.org/">veraPDF</a> is an open-source, industry-supported <strong>PDF/A</strong> and <strong>PDF/UA</strong> (Machine checks) validator and part of the OPF reference toolset. The Arlington PDF Checker is an open-source PDF checker based on the Arlington PDF Model.</em></p></blockquote>
<h3>Validation</h3>
<ul>
<li aria-level="1">(PDF/A-4, PDF/UA-2) added validation of pdfaid:rev and pdfuaid:rev</li>
<li aria-level="1">(PDF/UA-2, WTPDF-1) permit <b>math</b> to be not necessarily an immediate child of <b>Formula </b>structure element</li>
<li aria-level="1">(PDF/UA-2, WTPDF-1) limited PDF/UA-2 8.10.1 and 8.10.2 clauses to <b>Widget </b>annotations used as the appearance of interactive form fields.</li>
<li aria-level="1">(PDF/A-1) fixed matching date validation in case any dates are missing in the XMP metadata.</li>
<li aria-level="1">(PDF/UA-1,2, WTPDF-1) permitted top left cells with no Scope in Table structure element.</li>
</ul>
<h3>PDF Parser</h3>
<ul>
<li aria-level="1">Fixed runtime exceptions in cases of invalid data structures</li>
</ul>
<h3>Core library</h3>
<ul>
<li aria-level="1">End of PDFBox support</li>
<li aria-level="1">Added support for Java 25.</li>
<li aria-level="1">Added automatic module names to the MANIFEST files.</li>
<li aria-level="1">Fixed thread safety issues.</li>
</ul>
<h3>VeraPDF Installer</h3>
<ul>
<li aria-level="1">Added a new option to install CLI only, (no GUI included).</li>
<li aria-level="1">Removed sources of validation profiles and the model from the installer.</li>
</ul>
<h3>Arlington features</h3>
<ul>
<li aria-level="1">Implements the Arlington model as of the release date</li>
<li aria-level="1">Upgraded veraPDF parser to version 1.30.1 RC</li>
<li aria-level="1">Fixed retrieval of PDF version via the Version entry in the Catalog</li>
<li aria-level="1">Added auto-detect for ISO_19005_3 and WTPDF extensions</li>
<li aria-level="1">Fixed issues in processing merged annotations and fields</li>
</ul>
<h3>Security</h3>
<ul>
<li aria-level="1">Included the fix for GHSA-72hv-8253-57q</li>
</ul>
<p>Read the <a href="https://github.com/veraPDF/veraPDF-library/releases/tag/v1.30.1">release notes</a> to learn more about the new fixes and features.</p>
<h3>Download</h3>
<p><strong>veraPDF v1.30</strong> can be downloaded here: <a href="https://software.verapdf.org/releases/1.30/verapdf-greenfield-1.30.1-installer.zip">veraPDF 1.30</a>. Release builds are available in the <a href="https://software.verapdf.org/releases/1.30">release folder</a>. Developers can use the <strong>veraPDF</strong> software libraries in their Java projects <a href="https://central.sonatype.com/search?q=org.verapdf&amp;sort=published">via Maven Central</a>, an established distribution platform for Java software.</p>
<p>Docker images for <strong>veraPDF</strong> are also available. The containerised version of the command-line application is named <a href="https://hub.docker.com/r/verapdf/cli">verapdf/cli,</a> and the REST application is called <a href="https://hub.docker.com/r/verapdf/rest">verapdf/rest</a>.</p>
<p>Arlington v1.30 can be downloaded here: <a href="https://software.verapdf.org/releases/arlington/1.30/verapdf-arlington-1.30.1-installer.zip">Arlington 1.30</a>. Again, older builds can be found in the <a href="https://software.verapdf.org/releases/arlington/">release folder</a> and on <a href="https://central.sonatype.com/search?q=org.verapdf&amp;sort=published">Maven Central</a>.</p>
<p>Web demonstrators are available at <a href="https://demo.verapdf.org/">demo.verapdf.org</a> and <a href="https://arlington.verapdf.org/">arlington.verapdf.org</a>, respectively.</p>
<p>We invite the community to download and test the release. Please report bugs on the <a href="https://github.com/veraPDF/veraPDF-library/issues">veraPDF issue tracker on GitHub</a>.</p>
<h3><strong>About veraPDF</strong></h3>
<p><em><strong>veraPDF</strong> is an open-source PDF/A validator covering all parts of the PDF/A and PDF/UA (<a href="https://pdfa.org/matterhorn-protocol/">Matterhorn Protocol</a> Machine failure conditions) standards. Originally funded by the PREFORMA project, veraPDF has been sustained and maintained by the <a href="https://openpreservation.org/">Open Preservation Foundation</a> since 2017. <a href="https://duallab.com/">Dual Lab</a> provides active user support and carries out maintenance and bug fixes. The <a href="https://pdfa.org/">PDF Association</a>’s <a href="https://www.pdfa.org/community/pdf-a-technical-working-group/">PDF/A Technical Working Group</a> continues its role in resolving ambiguities arising from veraPDF’s usage in the field.</em></p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/verapdf-and-arlington-1-30-full-release-out-now/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Release: PDF4WCAG 1.8 Accessibility Checker</title>
		<link>https://duallab.com/new-release-pdf4wcag-1-8-accessibility-checker/</link>
		<comments>https://duallab.com/new-release-pdf4wcag-1-8-accessibility-checker/#respond</comments>
		<pubDate>Fri, 17 Apr 2026 10:18:10 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[PDF4WCAG]]></category>
		<category><![CDATA[Products]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7294</guid>
		<description><![CDATA[Dual Lab team is ready to announce a new update 1.8 to PDF4WCAG, delivering further improvements in validation accuracy, user experience,...]]></description>
				<content:encoded><![CDATA[<p><b>Dual Lab</b><span style="font-weight: 400;"> team is ready to announce a new update </span><b>1.8</b><span style="font-weight: 400;"> to </span><a href="https://pdf4wcag.com/"><b>PDF4WCAG</b></a><span style="font-weight: 400;">, delivering further improvements in validation accuracy, user experience, and overall stability.</span></p>
<h3><span style="font-weight: 400;">Improved Accuracy</span></h3>
<p><strong>Fixes in PDF/UA validation </strong><span style="font-weight: 400;">to align with latest technical discussions within TWGs of <strong>PDF Association</strong> and <a href="https://verapdf.org/">veraPDF</a> improvements:</span></p>
<ul>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">permit Math to be not necessarily an immediate child of Formula structure element;</span></li>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">improve glyph name calculation for </span><b>Type1</b><span style="font-weight: 400;"> and </span><b>TrueType fonts</b>;</li>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">adjusted validation of the PDF Table structure element.</span></li>
</ul>
<p><strong>Missing translations of error messages</strong><span style="font-weight: 400;"> have also been added to improve clarity across languages (Dutch, German, English).</span></p>
<h3><span style="font-weight: 400;">Enhanced User Experience</span></h3>
<p><strong>Error preview filters</strong><span style="font-weight: 400;"> have been reworked for more convenient error inspection. </span></p>
<p><img class="alignnone size-medium wp-image-7295" src="https://duallab.com/wp-content/uploads/2026/04/preview-265x300.png" alt="" width="265" height="300" srcset="https://duallab.com/wp-content/uploads/2026/04/preview-265x300.png 265w, https://duallab.com/wp-content/uploads/2026/04/preview.png 518w" sizes="(max-width: 265px) 100vw, 265px" /></p>
<p><strong>Export Validation Results:</strong><span style="font-weight: 400;"> users can export validation results as PDF for client reporting, documentation or internal audits purposes. Just click on the </span><b>Export results </b><span style="font-weight: 400;">on the Summary page.</span></p>
<p><img class="alignnone size-medium wp-image-7296" src="https://duallab.com/wp-content/uploads/2026/04/export_results-300x159.png" alt="" width="300" height="159" srcset="https://duallab.com/wp-content/uploads/2026/04/export_results-300x159.png 300w, https://duallab.com/wp-content/uploads/2026/04/export_results-768x408.png 768w, https://duallab.com/wp-content/uploads/2026/04/export_results-1024x543.png 1024w, https://duallab.com/wp-content/uploads/2026/04/export_results.png 1417w" sizes="(max-width: 300px) 100vw, 300px" /></p>
<p><img class="alignnone size-medium wp-image-7297" src="https://duallab.com/wp-content/uploads/2026/04/report-219x300.png" alt="" width="219" height="300" srcset="https://duallab.com/wp-content/uploads/2026/04/report-219x300.png 219w, https://duallab.com/wp-content/uploads/2026/04/report-768x1053.png 768w, https://duallab.com/wp-content/uploads/2026/04/report-747x1024.png 747w, https://duallab.com/wp-content/uploads/2026/04/report.png 926w" sizes="(max-width: 219px) 100vw, 219px" /></p>
<p><strong>One-Click Refresh:</strong><span style="font-weight: 400;"><strong> </strong>users can reupload and repeat the analysis of the document in one click (Web) or just via Refresh button in the Desktop version. </span></p>
<p><strong>GitHub and collaboration:</strong><span style="font-weight: 400;"><strong> PDF4WCAG</strong> now includes a direct link to its <a href="https://github.com/duallab/PDF4WCAG-public/issues">GitHub repository</a> within the feedback popup, inviting developers and users to contribute to the tool&#8217;s roadmap.</span></p>
<p><strong>The ability to use PDF4WCAG</strong><span style="font-weight: 400;"><strong> command line</strong> in the console (paid subscription).</span></p>
<p><strong>Commercial use of </strong><b>PDF4WCAG</b><span style="font-weight: 400;">: <a href="http://www.pdf4wcag.com/licensing/">the commercial use</a> of Desktop version and CLI automation is available in the annual subscription for just 299 EUR / 359 USD (excl. taxes). </span></p>
<blockquote><p><span style="font-weight: 400;">This release 1.8  reflects our ongoing commitment to providing precise, standards-aligned accessibility validation and a smoother user experience for organizations working toward </span><b>WCAG</b><span style="font-weight: 400;"> and </span><b>PDF/UA</b><span style="font-weight: 400;"> compliance.</span></p></blockquote>
<h3><span style="font-weight: 400;">Roadmap Update</span></h3>
<p><span style="font-weight: 400;">We’re excited to announce the start of beta testing for the <strong>PDF4WAG Integration API</strong>. If you’re interested in participating as a beta tester, please send us your request to </span><a href="mailto:info@pdf4wcag.com"><span style="font-weight: 400;">info@pdf4wcag.com</span></a><span style="font-weight: 400;">.</span></p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/new-release-pdf4wcag-1-8-accessibility-checker/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dual Lab launches reports on PDF Accessibility Trends</title>
		<link>https://duallab.com/dual-lab-launches-quarterly-reports-on-pdf-accessibility-trends-based-on-common-crawl-data/</link>
		<comments>https://duallab.com/dual-lab-launches-quarterly-reports-on-pdf-accessibility-trends-based-on-common-crawl-data/#respond</comments>
		<pubDate>Thu, 26 Mar 2026 07:40:40 +0000</pubDate>
		<dc:creator><![CDATA[Julia Katash]]></dc:creator>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">https://duallab.com/?p=7285</guid>
		<description><![CDATA[Dual Lab Launches Quarterly Reports on PDF Accessibility Trends based on Common Crawl data Dual Lab announces the upcoming publication of...]]></description>
				<content:encoded><![CDATA[<blockquote><p><strong>Dual Lab Launches Quarterly Reports on PDF Accessibility Trends based on Common Crawl data</strong></p></blockquote>
<p><b>Dual Lab</b><span style="font-weight: 400;"> announces the upcoming publication of a new analytical report on PDF Accessibility Trends from  the </span><a href="https://commoncrawl.org/"><b>Common Crawl</b></a><span style="font-weight: 400;"> dataset.  Such deep analytical reports will be released quarterly and will provide data-driven insights into global </span><b>PDF trends</b><span style="font-weight: 400;">. The first report  analyzes </span><b>15 million PDF documents</b><span style="font-weight: 400;"> from the </span><b>CC-MAIN-2026-04</b> <b>Common Crawl</b><span style="font-weight: 400;"> archive.</span></p>
<h3><b>Mild growth of Tagged PDFs share</b></h3>
<p><span style="font-weight: 400;">As a preview we present a sample report showing the share of Tagged PDFs among all PDFs in the </span><b>Common Crawl</b><span style="font-weight: 400;"> dataset, grouped by the document creation month.</span></p>
<p><span style="font-weight: 400;">Our analysis shows a mild increase in the proportion of tagged PDFs over the past three years. The share has been growing by approximately </span><b>1.5 percentage points per year</b><span style="font-weight: 400;">, surpassing the significant milestone of </span><b>50% in mid-2025</b><span style="font-weight: 400;">.</span></p>
<p><span style="font-weight: 400;">This means that today, more than half of newly created PDF documents appearing in the Common Crawl archives include structure tree with semantic information.</span></p>
<h3><b>Why Tagged PDFs Matter</b></h3>
<p><span style="font-weight: 400;">Tagged PDFs contain a structure tree that defines headings, paragraphs, tables, figures, and other semantic elements. This structure is essential for:</span></p>
<ul>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">The ability of Screen readers to understand the document</span></li>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">Logical reading order</span></li>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">Compliance with accessibility standards such as </span><b>PDF/UA</b></li>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">Alignment with </span><b>WCAG</b><span style="font-weight: 400;"> requirements</span></li>
</ul>
<p><span style="font-weight: 400;">The growth in tagged documents indicates a positive global shift toward better structured and potentially more accessible PDF publishing.</span></p>
<h3><b>Trend in the Share of Tagged PDFs Among All PDFs</b></h3>
<p><img class="alignnone size-medium wp-image-7286" src="https://duallab.com/wp-content/uploads/2026/03/pdf-300x145.png" alt="" width="300" height="145" srcset="https://duallab.com/wp-content/uploads/2026/03/pdf-300x145.png 300w, https://duallab.com/wp-content/uploads/2026/03/pdf.png 736w" sizes="(max-width: 300px) 100vw, 300px" /></p>
<p><b>Dual Lab</b><span style="font-weight: 400;"> analyzed 15 millions of PDF documents from the </span><b>Common Crawl</b><span style="font-weight: 400;"> dataset CC-MAIN-2026-04 to examine how the share of tagged PDFs has changed over time.</span></p>
<p><span style="font-weight: 400;">The results show a clear rising trend over the past three years. The proportion of tagged PDFs documents containing a structural tag tree has increased steadily by approximately </span><b>1.5 percentage points per year</b><span style="font-weight: 400;">.</span></p>
<p><span style="font-weight: 400;">A key milestone was reached in </span><b>mid-2025 (July)</b><span style="font-weight: 400;">, when the share exceeded </span><b>50%</b><span style="font-weight: 400;"> for the first time. This indicates that more than half of newly created PDF documents indexed </span><b>in Common Crawl </b><span style="font-weight: 400;">now include structural tagging.</span></p>
<p><span style="font-weight: 400;">The growth reflects broader adoption of structured document generation tools and increasing awareness of accessibility and machine-readability requirements. While the trend is positive, continued monitoring is essential to evaluate not only the presence of tags but also their structural quality.</span></p>
<h3><b>Reports by Dual Lab</b></h3>
<p><b>Dual Lab</b><span style="font-weight: 400;"> aims to provide objective data that supports users, accessibility experts, and organizations working toward more inclusive digital content.</span></p>
<p><b><i>The first full report will be published soon. </i></b></p>
<p><b>Reports will be available: </b><span style="font-weight: 400;"><strong>Dual Lab</strong> website, </span><strong>PDF4WCAG</strong><span style="font-weight: 400;"> website (the PDF Accessibility validation tool developed by Dual Lab), </span><a href="https://groups.google.com/g/duallab"><span style="font-weight: 400;">Google Group </span><span style="font-weight: 400;">Dual Lab Dual Lab Reports on PDF Accessibility Trends</span></a><span style="font-weight: 400;">; <a href="https://discord.gg/ny6bG5Yfe">Discord</a> server; our channels in </span><a href="https://x.com/PDF4WCAG"><span style="font-weight: 400;">X</span></a><span style="font-weight: 400;"> and </span><a href="https://www.linkedin.com/company/3658503/admin/page-posts/published/"><span style="font-weight: 400;">Linkedin. </span></a></p>
]]></content:encoded>
			<wfw:commentRss>https://duallab.com/dual-lab-launches-quarterly-reports-on-pdf-accessibility-trends-based-on-common-crawl-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
