Generated on April 05 2026 22:40 PM
Old data? UPDATE !
The score is 56/100
Title
Extract Clean Content from Any Webpage — Trafilatura 🧰
Length : 54
Perfect, your title contains between 10 and 70 characters.
Description
Extract clean, readable content from any website. Uses Trafilatura to strip navigation, ads, and boilerplate. Try it free — no login required. 🔧🛠
Length : 145
Great, your meta description contains between 70 and 160 characters.
Keywords
content extraction,trafilatura,web scraping,text extraction,article extraction
Good, your page contains meta keywords.
Og Meta Properties
Good, your page take advantage of Og Properties.
| Property | Content |
|---|---|
| title | Extract Clean Content from Any Webpage — Trafilatura 🧰 |
| description | Extract clean, readable article text from any web page. Uses Trafilatura to remove navigation, ads, cookie banners, and boilerplate, leaving the main content as plain text or Markdown. Useful for feeding web content to LLMs or archiving articles. Free to try — no login required. 🔧🛠 |
| url | https://www.contextractor.com/ |
| site_name | Contextractor |
| locale | en_US |
| image | https://www.contextractor.com/_next/static/media/opengraph.361f26ff.png |
| image:width | 1200 |
| image:height | 630 |
| image:alt | Extract Clean Content from Any Webpage — Trafilatura |
| type | website |
Headings
| H1 | H2 | H3 | H4 | H5 | H6 |
| 1 | 3 | 3 | 4 | 0 | 0 |
Images
We found 3 images on this web page.
2 alt attributes are empty or missing. Add alternative text so that search engines can better understand the content of your images.
Text/HTML Ratio
Ratio : 0%
This page's ratio of text to HTML code is below 15 percent, this means that your website probably needs more text content.
Flash
Perfect, no Flash content has been detected on this page.
Iframe
Great, there are no Iframes detected on this page.
URL Rewrite
Good. Your links looks friendly!
Underscores in the URLs
Perfect! No underscores detected in your URLs.
In-page links
We found a total of 17 links including 0 link(s) to files
| Anchor | Type | Juice |
|---|---|---|
| What is Contextractor? | Internal | Passing Juice |
| Trafilatura | External | Passing Juice |
| CLI | Internal | Passing Juice |
| Docker | Internal | Passing Juice |
| Apify actor | External | Passing Juice |
| Playground | Internal | Passing Juice |
| What is Trafilatura? | Internal | Passing Juice |
| Apify | External | Passing Juice |
| free tier | External | Passing Juice |
| Creator plan | External | Passing Juice |
| Home | Internal | Passing Juice |
| About | Internal | Passing Juice |
| Press kit | Internal | Passing Juice |
| Library | Internal | Passing Juice |
| GitHub | External | Passing Juice |
| Terms | Internal | Passing Juice |
| Privacy | Internal | Passing Juice |
Keywords Cloud
github terms runs home help kit months apify library press
Keywords Consistency
| Keyword | Content | Title | Keywords | Description | Headings |
|---|---|---|---|---|---|
| apify | 2 | ![]() |
![]() |
![]() |
![]() |
| months | 1 | ![]() |
![]() |
![]() |
![]() |
| terms | 1 | ![]() |
![]() |
![]() |
![]() |
| github | 1 | ![]() |
![]() |
![]() |
![]() |
| help | 1 | ![]() |
![]() |
![]() |
![]() |
Url
Domain : contextractor.com
Length : 17
Favicon
Great, your website has a favicon.
Printability
We could not find a Print-Friendly CSS.
Language
Good. Your declared language is en.
Dublin Core
This page does not take advantage of Dublin Core.
Doctype
HTML 5
Encoding
Perfect. Your declared charset is UTF-8.
W3C Validity
Errors : 0
Warnings : 0
Email Privacy
Great no email address has been found in plain text!
Deprecated HTML
Great! We haven't found deprecated HTML tags in your HTML.
Speed Tips
![]() |
Excellent, your website doesn't use nested tables. |
![]() |
Too bad, your website is using inline styles. |
![]() |
Great, your website has few CSS files. |
![]() |
Too bad, your website has too many JS files (more than 6). |
![]() |
Perfect, your website takes advantage of gzip. |
Mobile Optimization
![]() |
Apple Icon |
![]() |
Meta Viewport Tag |
![]() |
Flash content |
XML Sitemap
Great, your website has an XML sitemap.
| https://www.contextractor.com/sitemap.xml |
Robots.txt
https://contextractor.com/robots.txt
Great, your website has a robots.txt file.
Analytics
Great, your website has an analytics tool.
Google Analytics |
Free SEO Testing Tool is a free SEO tool which provides you content analysis of the website.