What makes a PDF accessible?

An accessible PDF has a logical tag structure (headings, paragraphs, lists, tables), alt text for images, a defined reading order, a document title, a specified language, sufficient color contrast, and bookmarks for longer documents. It must be real text (not scanned images of text) and forms must have labeled fields.

Can scanned PDFs be made accessible?

Scanned PDFs are essentially images of pages with no underlying text. They must first be processed with Optical Character Recognition (OCR) to convert the image into selectable text, then tagged and structured for accessibility. OCR accuracy varies and the result must be proofread.

Is it better to use HTML instead of PDF for web content?

In most cases, yes. HTML is natively more accessible, responsive, and easier to maintain. PDFs should be reserved for content that requires fixed formatting (legal documents, printable forms, official publications). When a PDF is necessary, it must be made accessible.

TechnicalIntermediate

Accessible PDF

A PDF document that is structured and tagged so that its content can be read and navigated by assistive technologies such as screen readers.

In simple terms: A regular PDF is like a picture of a page — a computer can show it, but it cannot really understand what is on it. An accessible PDF has hidden labels and tags that tell the computer what each part is — this is a heading, this is a picture of a cat, this is a list. That way, special programs can read it out loud correctly.

What Is Accessible PDF?

An accessible PDF is a Portable Document Format file that has been structured and tagged so that its content can be properly interpreted and navigated by assistive technologies, particularly screen readers. Unlike a standard PDF, which may appear visually correct but contain no structural information, an accessible PDF includes a tag tree that defines the logical structure of the document — headings, paragraphs, lists, tables, images with alt text, and a defined reading order. The PDF format was originally designed for print-fidelity document exchange, prioritizing visual consistency across platforms. This print-oriented design means that without deliberate effort, PDFs are often among the least accessible document formats on the web. A screen reader encountering an untagged PDF may read text in the wrong order, miss content entirely, or present a table as an incomprehensible stream of text. The PDF/UA (Universal Accessibility) standard, published as ISO 14289, defines the technical requirements for accessible PDF documents. It builds on the existing PDF tag architecture to establish clear rules for how content must be structured, tagged, and presented. WCAG also includes PDF-specific techniques that map its success criteria to PDF authoring practices. Accessible PDFs are particularly important in government, education, healthcare, and legal contexts where PDF is the standard format for official documents. Section 508 requires all federal electronic documents, including PDFs, to be accessible. The ADA extends this requirement to state and local governments (Title II) and businesses open to the public (Title III).

Why It Matters

PDFs are ubiquitous. Tax forms, legal contracts, academic papers, product manuals, financial statements, and government publications are routinely distributed as PDFs. When these documents are inaccessible, entire categories of essential information are locked away from people with disabilities. The problem is widespread. Studies have consistently found that the majority of PDFs on the web are inaccessible. A 2019 analysis by the accessibility firm Level Access found that over 90% of government PDFs tested had accessibility issues. The situation has improved but remains a significant gap. For organizations, inaccessible PDFs represent legal risk. Federal agencies face Section 508 compliance requirements. State and local governments face ADA Title II obligations reinforced by the 2024 web accessibility rule. Private businesses face Title III litigation — and PDFs are increasingly cited in accessibility lawsuits and demand letters. Beyond compliance, accessible PDFs serve practical purposes. They are searchable, indexable by search engines, compatible with text-to-speech on mobile devices, and usable on refreshable braille displays. An accessible PDF is a better document for everyone.

How It Works

Creating an accessible PDF involves several layers of structure and metadata. ### Tag Structure The foundation of PDF accessibility is the tag tree — a hierarchical structure that defines what each piece of content is. Tags include `<H1>` through `<H6>` for headings, `<P>` for paragraphs, `<L>` and `<LI>` for lists, `<Table>`, `<TR>`, `<TH>`, and `<TD>` for tables, `<Figure>` for images, and `<Link>` for hyperlinks. The tag tree is conceptually similar to HTML's DOM structure and serves the same purpose: conveying meaning independent of visual presentation. Tags must be applied in a logical reading order that matches the intended content sequence. In multi-column layouts, for instance, the tag order must ensure that a screen reader reads the content column by column rather than jumping across the page line by line. ### Alternative Text Images in accessible PDFs must have alt text that describes their content or purpose. Decorative images should be marked as artifacts so screen readers skip them. Complex images like charts or diagrams may need both alt text and a longer description provided in the surrounding text. ### Document Metadata Accessible PDFs must include metadata: a document title (displayed in the title bar instead of the filename), a specified language (so screen readers use the correct pronunciation), and bookmarks for documents longer than a few pages that allow users to jump directly to sections. ### Table Structure Tables must use proper header cell markup (`<TH>`) with scope attributes indicating whether they are row or column headers. Complex tables with merged cells or multiple header levels require explicit ID-and-headers associations so screen readers can correctly associate data cells with their headings. ### Form Fields Interactive PDF forms must have labeled form fields with accessible names, descriptions for complex fields, a logical tab order, and clear instructions. Error messages must be programmatically associated with their fields. ### Authoring Workflow The most efficient way to create accessible PDFs is to start with an accessible source document. When using Microsoft Word, applying heading styles, adding alt text to images, using built-in list and table tools, and then exporting to PDF with the "tagged PDF" option produces a reasonably accessible result. Adobe InDesign similarly supports accessibility tagging. Documents created in these tools still typically require review and remediation in Adobe Acrobat Pro, which provides an accessibility checker and tag-editing tools. Retrofitting accessibility into existing untagged PDFs (remediation) is more labor-intensive and typically involves running Acrobat's "Make Accessible" wizard, manually correcting the tag tree, adding alt text, setting reading order, and validating the result.

Examples

**Inaccessible PDF:** A scanned paper form is posted as a PDF on a government website. It is a flat image with no text layer, no tags, and no form fields. Screen readers cannot read any content, and the form cannot be filled out digitally. **Accessible PDF:** The same form is recreated as a tagged PDF with a text layer, heading structure, labeled form fields, and tab order. A screen reader can read the form instructions and navigate between fields. **Inaccessible PDF:** A financial report has a complex table showing quarterly revenue by product line. The table has no header tags, so a screen reader reads every cell in sequence with no indication of which column or row it belongs to. **Accessible PDF:** The same table uses `<TH>` tags with proper scope for row and column headers. A screen reader announces "Q3 2025, Widget Product Line, $4.2 million" as the user navigates cells. **Inaccessible PDF:** A marketing brochure with a two-column layout has the reading order set left-to-right across both columns rather than top-to-bottom within each column. The screen reader reads the first line of column one followed by the first line of column two, producing garbled output. **Accessible PDF:** The tag tree defines the reading order as column one top-to-bottom, then column two top-to-bottom, matching the visual reading flow.

Key Takeaways

**Prefer HTML over PDF when possible.** HTML is inherently more accessible, responsive, and easier to update. Reserve PDF for content that requires fixed formatting.
**Start accessible, not remediate later.** Building accessibility into the source document (Word, InDesign) is far more efficient than remediating an untagged PDF after the fact.
**Tag structure is essential.** Without a proper tag tree, a PDF is essentially a flat image to assistive technology. Tags define headings, lists, tables, reading order, and content relationships.
**Test with assistive technology.** Acrobat's built-in accessibility checker catches structural issues, but testing with an actual screen reader (NVDA or JAWS) reveals how the document actually reads.
**Scanned PDFs require OCR first.** A scanned document must be converted from image to text via OCR before any accessibility tagging can be applied.
**Plan for ongoing maintenance.** Accessible PDFs must be maintained as content changes. Updated documents need re-verification, and new versions should follow the same accessibility standards as the original.

Frequently Asked Questions

What makes a PDF accessible?: An accessible PDF has a logical tag structure (headings, paragraphs, lists, tables), alt text for images, a defined reading order, a document title, a specified language, sufficient color contrast, and bookmarks for longer documents. It must be real text (not scanned images of text) and forms must have labeled fields.
Can scanned PDFs be made accessible?: Scanned PDFs are essentially images of pages with no underlying text. They must first be processed with Optical Character Recognition (OCR) to convert the image into selectable text, then tagged and structured for accessibility. OCR accuracy varies and the result must be proofread.
Is it better to use HTML instead of PDF for web content?: In most cases, yes. HTML is natively more accessible, responsive, and easier to maintain. PDFs should be reserved for content that requires fixed formatting (legal documents, printable forms, official publications). When a PDF is necessary, it must be made accessible.

Need help making your website ADA compliant?

Our team specializes in ADA-compliant web design and remediation. Get a free accessibility audit today.

Last updated: 2026-03-15