Question 1

How does the EPUB to Markdown converter work?

Accepted Answer

SmartMarkdown opens the EPUB ZIP archive in memory, reads the OPF package manifest to determine chapter order, then extracts each chapter's HTML content file. The HTML is parsed using DOM traversal, mapping semantic HTML elements (h1-h6, p, ul, ol, table, blockquote, em, strong) to their Markdown equivalents. All processing runs in your browser with no server upload.

Question 2

What versions of EPUB are supported?

Accepted Answer

SmartMarkdown supports EPUB 2 (OPS 2.0/OPF 2.0, common before 2011) and EPUB 3 (the current standard, using XHTML5 for content documents). Both formats use the same ZIP-based container structure. EPUB 3's additional features such as media overlays and scripted content are ignored — only the text and structure are extracted.

Question 3

Does the converter work with DRM-protected EPUB files?

Accepted Answer

No. DRM (Digital Rights Management) protected EPUB files encrypt their content so it cannot be read without the authorized DRM client. SmartMarkdown can only process DRM-free EPUB files. Most EPUB files from Project Gutenberg, Standard Ebooks, personal exports, and many independent publishers are DRM-free.

Question 4

How are multiple chapters handled in the output?

Accepted Answer

Each chapter in the EPUB (as defined by the OPF spine order) becomes a section in the Markdown output. The chapter's first H1 or H2 heading (from the HTML content) is used as the section heading. Chapters are output in the reading order defined by the EPUB's spine, with a horizontal rule separator between each chapter.

Question 5

Is book metadata (title, author, ISBN) included?

Accepted Answer

Yes. SmartMarkdown extracts metadata from the EPUB's OPF package file — including the book title, author names, publisher, language, publication date, and ISBN/identifier — and includes it as a YAML front matter block at the top of the converted Markdown document. This is useful for documentation workflows that use front matter for metadata.

Question 6

How are embedded images in EPUB handled?

Accepted Answer

Images embedded in EPUB chapter HTML files are noted as Markdown image references with the original src path preserved as the alt text. The actual image binary data is not extracted or embedded in the Markdown output. If you need images, extract the EPUB archive separately to access the image files.

Question 7

Can I convert technical documentation packaged as EPUB?

Accepted Answer

Yes — technical documentation in EPUB format (such as O'Reilly books, language specifications, or standards documents distributed as EPUB) converts well because they typically use clean, semantic HTML structure for their content. Chapters map to sections, code examples to fenced code blocks, and tables to GFM pipe tables.

Question 8

Are footnotes and endnotes preserved in the output?

Accepted Answer

Footnotes in EPUB files are typically implemented as hyperlinked HTML anchors, often in separate content documents. SmartMarkdown extracts the footnote text and references them as Markdown footnote syntax where the structure is clear. Complex footnote implementations may require manual cleanup in the editor after conversion.

EPUB to Markdown Converter

What Is an EPUB to Markdown Converter

The EPUB File Format Explained

How EPUB to Markdown Conversion Works

Benefits of Converting EPUB to Markdown

Common Use Cases

Tips for Better Conversion Results

Frequently Asked Questions