Firefox Browser Add-ons
  • Extensions
  • Themes
    • for Firefox
    • Dictionaries & Language Packs
    • Other Browser Sites
    • Add-ons for Android
Log in
Add-on icon

Page Content to Markdown version history - 2 versions

Page Content to Markdown by Jared

There are no ratings yet
0 Stars out of 5
5
0
4
0
3
0
2
0
1
0
Page Content to Markdown version history - 2 versions
  • Be careful with old versions! These versions are displayed for testing and reference purposes.You should always use the latest version of an add-on.

  • Latest version

    Version 1.0.1

    Released May 12, 2026 - 119.44 KB
    Works with firefox 109.0 and later
    Fixed
    • General extractor picks the largest matching candidate per selector, not the first. On The Verge, the first <article> on a story page is a related-cards stub — first-match-wins picked it and returned empty markdown. Score every match by textContent.length and pick the largest qualifying candidate.
    • Tighter content-significance threshold. Bump the hasSignificantContent floor to ≥3 <p> descendants and ≥500 chars of trimmed text. Rejects related-card grids that previously slipped through because their aggregated link text passed the old 50-char gate.
    • SVG elements no longer crash Turndown mid-traversal. SVG className is a SVGAnimatedString, not a string; calling .toLowerCase() on it threw and Turndown returned '' for the whole page. Read class via getAttribute('class') throughout the converter, with a fallback to .baseVal for safety. Eliminates a silent empty-output failure mode on news sites that ship inline SVG icons.
    • Visible junk inside the article body no longer ships through. Expanded the non-content substring regex with author-bio, author-card, byline-bio, topics-list, tags-list, tags-row, subscribe, affiliate, disclosure, disclaimer, share-row, share-icons, social-icons, related-articles, related-stories, read-more-cta, keep-reading, frequently-asked, faq-, further-reading, comments-section. Clears author-bio cards on TechCrunch / Tom's Guide, the trailing FAQ section on Mashable, and the end-of-post subscribe widget on Substack.
    • Structural section rejector for related/topics/FAQ/subscribe blocks. Any <section> or <div> whose first heading (looking one level deep through a wrapper div) reads as Topics, Tags, Related…, Frequently Asked…, Further Reading, Read Next, Keep Reading, Recommended, or Subscribe to… gets rejected wholesale, regardless of class names. Catches framework-generated wrappers (mx-auto mt-12, pc-paddingTop-32) that didn't pattern-match before.

    Source code released under MIT License

    Download Firefox and get the extension
    Download file
  • Older versions

    Version 1.0.0

    Released May 8, 2026 - 115.92 KB
    Works with firefox 109.0 and later

    Source code released under MIT License

    Download file
Go to Mozilla's homepage

Add-ons

  • About
  • Firefox Add-ons Blog
  • Extension Workshop
  • Developer Hub
  • Developer Policies
  • Community Blog
  • Forum
  • Report a bug
  • Review Guide

Browsers

  • Desktop
  • Mobile
  • Enterprise

Products

  • Browsers
  • VPN
  • Relay
  • Monitor
  • Pocket
  • Bluesky (@firefox.com)
  • Instagram (Firefox)
  • YouTube (firefoxchannel)
  • Privacy
  • Cookies
  • Legal

Except where otherwise noted, content on this site is licensed under the Creative Commons Attribution Share-Alike License v3.0 or any later version.