Question 1

What is the structure of the generated JSON?

Accepted Answer

The top-level JSON object has two properties: metadata (wordCount, sectionCount, generatedAt timestamp) and document (title from the first H1, sections array). Each section object has: id (slug from heading text), heading (the heading text), level (1-6), content (paragraph text concatenated), codeBlocks (array of {language, code} objects), tables (array of {headers, rows} objects), and subsections (nested sections array for child headings).

Question 2

How are nested headings handled in the JSON output?

Accepted Answer

SmartMarkdown builds a depth-based nesting hierarchy. H2 headings are top-level sections in the sections array. H3 headings become subsections of the preceding H2. H4 headings become sub-subsections of the preceding H3. The nesting mirrors the document outline — each section object has a subsections array that contains its child sections at the next heading level.

Question 3

How are code blocks represented in the JSON?

Accepted Answer

Each fenced code block within a section is extracted to the codeBlocks array of that section object. A code block object has two fields: language (the language hint after the opening fence, e.g. 'javascript', 'python', or null if no hint was provided) and code (the code content as a string, with newlines preserved). Multiple code blocks in one section produce multiple objects in the codeBlocks array.

Question 4

How are Markdown tables represented in the JSON?

Accepted Answer

GFM pipe tables within a section are extracted to the tables array of that section object. A table object has headers (an array of header cell strings) and rows (a 2D array — each element is an array of cell strings for one row). Column alignment metadata from the separator row is not included in the current JSON schema but is available as a planned enhancement.

Question 5

What metadata fields are included?

Accepted Answer

The metadata object includes: wordCount (total word count of all paragraph text in the document, excluding headings and code), sectionCount (total number of heading-level sections, including nested subsections), generatedAt (ISO 8601 UTC timestamp of when the conversion was performed), and sourceLength (the character count of the Markdown source). These fields are useful for content pipeline validation and monitoring.

Question 6

How does the converter handle very large Markdown documents?

Accepted Answer

SmartMarkdown processes the Markdown synchronously in the browser. For typical documentation documents (up to ~500KB of Markdown text), conversion is instantaneous. Very large documents (multi-megabyte Markdown files with hundreds of sections) may take 1–3 seconds. The JSON output for a large document may be significant in size — a 100-section document typically produces 50–200KB of JSON depending on content length.

Question 7

How can I use the JSON output in a Next.js application?

Accepted Answer

Copy the JSON output into a .json file in your project's data directory. Import it in a Next.js page or API route: import content from '@/data/document.json'. The typed structure makes it easy to render sections dynamically, build a search index, or pass to an API. For static site generation, you can also run the conversion server-side using the marked library and a custom JSON serialiser in a getStaticProps function.

Question 8

How do I update the JSON when the Markdown changes?

Accepted Answer

Simply re-run the conversion: paste the updated Markdown into SmartMarkdown and download the new JSON. For automated workflows, consider integrating the conversion into your build pipeline — SmartMarkdown uses the marked library (MIT licensed) which you can install via npm and use in a Node.js script to regenerate JSON files as part of your build or CI process.

Markdown to JSON Converter

What Is a Markdown to JSON Converter

The JSON Schema

Section Hierarchy

Benefits of Markdown to JSON

Common Use Cases

Tips for Better JSON Output

Frequently Asked Questions