Question 1

How does the HTML to Markdown converter work?

Accepted Answer

SmartMarkdown uses the browser's built-in DOMParser API to parse your HTML input into a full DOM tree. It then walks the element tree recursively, mapping each semantic HTML element to its Markdown equivalent — headings to hash syntax, lists to dashes, tables to pipe tables, links to bracket syntax, and code elements to backticks. All processing runs in your browser with no server round-trip.

Question 2

Do I paste the HTML source code or the rendered page?

Accepted Answer

Paste the raw HTML source code. To get the HTML source of a webpage, right-click and select 'View Page Source', or use your browser's developer tools to copy the outer HTML of a specific element. Pasting rendered text (copying what you see on the page) loses all structure — the converter needs the HTML markup to detect headings, lists, and other elements.

Question 3

Which HTML elements are converted to Markdown?

Accepted Answer

SmartMarkdown converts: h1–h6 (heading syntax), p (paragraphs), ul/ol/li (lists), table/tr/th/td (GFM pipe tables), a (links), strong/b (bold), em/i (italic), code (inline code), pre/code (fenced code blocks), blockquote (blockquotes), img (image syntax), hr (horizontal rules), and br (line breaks). Non-semantic elements like div, span, and section are treated as transparent containers and their text content is preserved.

Question 4

How are HTML tables converted to Markdown?

Accepted Answer

SmartMarkdown walks the table's thead, tbody, and tr/th/td structure to extract the header row and data rows. Each row is output as a GFM pipe table row with cells separated by | characters. A separator row of dashes is inserted after the header row. Tables with colspan or rowspan attributes are linearized — the merged content is placed in the first cell of the merged range.

Question 5

Can I convert a full webpage's HTML?

Accepted Answer

Yes, but consider pasting only the main content area's HTML rather than the full page source. A full page source includes navigation, footer, sidebar, and script elements that will produce noisy output. Use browser developer tools to select and copy the outer HTML of the main content element (

,

, or the primary content div) for a cleaner conversion.

Question 6

Are CSS styles and inline styles preserved?

Accepted Answer

No. CSS classes, inline style attributes, and stylesheet rules have no Markdown equivalent. SmartMarkdown relies entirely on the semantic HTML element types and their structural relationships to determine Markdown output. Visual styling information is discarded.

Question 7

How are script and style tags handled?

Accepted Answer

Script (<script>) and style (<style>) elements and their content are automatically excluded from the Markdown output. SmartMarkdown's DOM walker skips these non-content elements to prevent JavaScript code and CSS from appearing in the converted text.

Question 8

Can I use this to migrate content from a CMS?

Accepted Answer

Yes — CMS content migration is one of the primary use cases. Export or copy the HTML from your CMS editor or page source, paste it into SmartMarkdown, and get clean Markdown output. This works well for WordPress posts, Drupal pages, and any CMS that stores content as HTML. The output can then be imported into a Markdown-native CMS or static site generator.

HTML to Markdown Converter

What Is an HTML to Markdown Converter

How DOM-Based Conversion Works

Supported HTML Elements

Benefits of Converting HTML to Markdown

Common Use Cases

Tips for Cleaner Conversion Output

Frequently Asked Questions