webdatacommons.org

Website review webdatacommons.org

Web Data Commons

 Generated on March 15 2026 13:43 PM

Old data? UPDATE !

The score is 54/100

SEO Content

Title

Web Data Commons

Length : 16

Perfect, your title contains between 10 and 70 characters.

Description

Length : 0

Very bad. We haven't found meta description on your page. Use this free online meta tags generator to create description.

Keywords

Very bad. We haven't found meta keywords on your page. Use this free online meta tags generator to create keywords.

Og Meta Properties

This page does not take advantage of Og Properties. This tags allows social crawler's better structurize your page. Use this free og properties generator to create them.

Headings

H1 H2 H3 H4 H5 H6
1 7 5 0 0 0
  • [H1] Web Data Commons
  • [H2] News
  • [H2] Available Data Sets
  • [H2] Available Software
  • [H2] License
  • [H2] Feedback
  • [H2] About Web Data Commons Project
  • [H2] Credits
  • [H3] RDFa, Microdata, and Microformat
  • [H3] Web Tables
  • [H3] Hyperlink Graph
  • [H3] WebIsA Database
  • [H3] Extraction Framework

Images

We found 6 images on this web page.

Good, most or all of your images have alt attributes.

Text/HTML Ratio

Ratio : 58%

Ideal! This page's ratio of text to HTML code is between 25 and 70 percent.

Flash

Perfect, no Flash content has been detected on this page.

Iframe

Great, there are no Iframes detected on this page.

URL Rewrite

Good. Your links looks friendly!

Underscores in the URLs

We have detected underscores in your URLs. You should rather use hyphens to optimize your SEO.

In-page links

We found a total of 90 links including 12 link(s) to files

Anchor Type Juice
Common Crawl External Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
schema.org class-specific subsets Internal Passing Juice
Schema.org Table Corpus 2023 Internal Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
schema.org class-specific subsets Internal Passing Juice
WDC Block Internal Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
schema.org class-specific subsets Internal Passing Juice
WDC Products Internal Passing Juice
Schema.org Table Annotation Benchmark Internal Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
schema.org class-specific subsets Internal Passing Juice
WDC Product Data Corpus V.2020 Internal Passing Juice
December 2020 WDC schema.org Product and Offer subsets Internal Passing Juice
Schema.org Table Corpus Internal Passing Juice
Improving Hierarchical Product Classification using Domain-specific Language Modelling External Passing Juice
Knowledge Management in e-Commerce workshop External Passing Juice
The Web Conference 2021 External Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
Intermediate Training of BERT for Product Matching External Passing Juice
Version 2.0 Internal Passing Juice
DI2KG workshop External Passing Juice
VLDB2020 External Passing Juice
Using schema.org Annotations for Training and Maintaining Product Matchers External Passing Juice
WIMS2020 External Passing Juice
CfP External Passing Juice
Semantic Web Challenge External Passing Juice
ISWC2020 External Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
Web Tables for Long-Tail Entity Extraction (T4LTE) Internal Passing Juice
Time-Dependent Ground Truth (TDGT) Internal Passing Juice
Using the Semantic Web as a Source of Training Data Internal Passing Juice
Datenbank-Spektrum Journal External Passing Juice
The WDC training dataset and gold standard for large-scale product matching External Passing Juice
ECNLP Workshop External Passing Juice
WWW2019 External Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
WDC Training Dataset and Gold Standard for Large-Scale Product Matching Internal Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
Web Data Integration Framework (WInte.r) External Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
Gold Standard for Product Matching and Product Feature Extraction Internal Passing Juice
RDFa, Microdata, Microformat, and Embedded JSON-LD Internal Passing Juice
web-scale "IsA" database Internal Passing Juice
Profiling the Potential of Web Tables for Augmenting Cross-domain Knowledge Bases External Passing Juice
WWW'16 External Passing Juice
Yahoo GitHub repository External Passing Juice
Yahoo tumblr posting External Passing Juice
WDC Web Table Corpus 2015 Internal Passing Juice
The Graph Structure in the Web - Analyzed on Different Aggregation Levels External Passing Juice
Journal of Web Science External Passing Juice
RDFa, Microdata, and Microformat Internal Passing Juice
T2D Gold Standard Internal Passing Juice
Heuristics for Fixing Common Errors in Deployed schema.org Microdata External Passing Juice
ESWC2015 External Passing Juice
Class-Specific Subsets of the Schema.org Data contained in the Winter 2013 Microdata Corpus Internal Passing Juice
WDC Extraction Framework Internal Passing Juice
guest post External Passing Juice
Hyperlink Graph Dataset Internal Passing Juice
ISWC'14 External Passing Juice
The WebDataCommons Microdata, RDFa and Microformat Dataset Series External Passing Juice
WebSci'14 External Passing Juice
Graph Structure in the Web - Aggregated by Pay-Level Domain External Passing Juice
RDFa, Microdata, and Microformat Internal Passing Juice
WDC Web Tables Internal Passing Juice
First open ranking of the World Wide Web Internal Passing Juice
WDC Hyperlink Graph Internal Passing Juice
Graph Structure in the Web - Revisited External Passing Juice
Integrating Product Data from Websites offering Microdata Markup External Passing Juice
Deployment of RDFa, Microdata, and Microformats on the Web -- A Quantitative Analysis External Passing Juice
types of products that are offered by e-shops using Microdata markup Internal Passing Juice
Details External Passing Juice
RDFa, Microdata, and Microformat Internal Passing Juice
new analysis on vocabulary usage Internal Passing Juice
AWS Summit 2012 Berlin External Passing Juice
Slides External Passing Juice
References Internal Passing Juice
demo web application Internal Passing Juice
first corpus Internal Passing Juice
Amazon EC2 cloud services External Passing Juice
Apache Software License External Passing Juice
Web Data Commons mailing list External Passing Juice
Web Data Commons Google Group External Passing Juice
PlanetData External Passing Juice
LOD2 External Passing Juice
Amazon Web Services in Education Grant Award External Passing Juice
German Research Foundation (DFG) External Passing Juice
ViCe External Passing Juice
Ministry of Economy, Research and Arts of Baden - Württemberg External Passing Juice

SEO Keywords

Keywords Cloud

wdc extracted common web from data microdata released product corpus

Keywords Consistency

Keyword Content Title Keywords Description Headings
data 70
web 54
from 38
corpus 35
wdc 33

Usability

Url

Domain : webdatacommons.org

Length : 18

Favicon

Great, your website has a favicon.

Printability

We could not find a Print-Friendly CSS.

Language

You have not specified the language. Use this free meta tags generator to declare the intended language of your website.

Dublin Core

This page does not take advantage of Dublin Core.

Document

Doctype

HTML 5

Encoding

Perfect. Your declared charset is UTF-8.

W3C Validity

Errors : 0

Warnings : 0

Email Privacy

Great no email address has been found in plain text!

Deprecated HTML

Great! We haven't found deprecated HTML tags in your HTML.

Speed Tips

Excellent, your website doesn't use nested tables.
Too bad, your website is using inline styles.
Great, your website has few CSS files.
Perfect, your website has few JavaScript files.
Perfect, your website takes advantage of gzip.

Mobile

Mobile Optimization

Apple Icon
Meta Viewport Tag
Flash content

Optimization

XML Sitemap

Great, your website has an XML sitemap.

https://webdatacommons.org/sitemap.xml
http://webdatacommons.org/sitemap.xml

Robots.txt

https://webdatacommons.org/robots.txt

Great, your website has a robots.txt file.

Analytics

Great, your website has an analytics tool.

   Google Analytics

PageSpeed Insights


Device
Categories

Free SEO Testing Tool

Free SEO Testing Tool is a free SEO tool which provides you content analysis of the website.