Internet as Database + API

Big Data of the Web

prev. 34 d
Dbs / Sites Tables Columns Rows Data Bytes Media Bytes Media Items
1,373
(+ 86)
8,304
(+ 463)
51,701
(+ 3,087)
1,622,810,038
(+ 110,607,176)
188.46G
(+ 7.51G)
1.45T
(+ 179.98G)
34,682,953
(+ 3,775,911)
Dbs / Sites 1,373
(+ 86)
Tables 8,304
(+ 463)
Columns 51,701
(+ 3,087)
Rows 1,622,810,038
(+ 110,607,176)
Data Bytes 188.46G
(+ 7.51G)
Media Bytes 1.45T
(+ 179.98G)
Media Items 34,682,953
(+ 3,775,911)
2018-09-13 01:27:17 (7 d ago)

What is DataSN?

Data crawler and parser of all websites


DataSN, or Data Social Network, crawls, parses and hosts all data of the Internet, not raw web pages, but data objects that are both machine friendly and human readable. More than website scraper, DataSN extracts, cleanse, normalize, categorize, and format data.

One account, world's all data


In time, DataSN will become the exchange of all data. For now, we start by collecting the entire Internet for all online data. You need only one membership to access all of them.

Extremely clean and atomic data


DataSN data sets are rigorously sanitized and cleansed. We frown upon unparsed strings and raw bytes, providing atomic data values that are immediately usable by the simplest of programs.

Semantic and meaningful data names


DataSN columns and tables are properly named after the meaning and nature of the data so you instantly know what the data is about.

Easy incremental updates


Every table row is stamped with the time it's created or last updated. You can easily find the newly created or updated data rows since your last retrieval.

Highly normalized and structural


A typical DataSN database or data set consists of multiple tables that are related to each other to form a family, well organized as a network of relations. Data are structurally normalized to reduce redundancy, to facilitate association, categorization, traversing, and searching.

Live API data as soon as being crawled


Data are instantly published via API as soon as they are crawled so your program knows what happens in real world by the minute.

All popular formats


Data should be formatless rather than be bound to a specific application. DataSN data is neutral in formats not affiliated with any proprietary application by delivering the same piece of data in all formats you can imagine, among which the most popular being JSON, CSV, Excel, and HTML. Advanced formats are available per request, such as MySQL, MSSQL, etc.

Easy integration with apps


DataSN's well defined API and solid HTTP infrastructure make it possible to easily and seamlessly integrate the live data streams with your applications to form data pipes or data flows.

Did we mention the media files?


DataSN crawls not just text but also all the media. Media files, such as images, are exhaustively collected, meticulously tagged, categorized and associated with its particular data row(s) so they are searchable and retrievable by information about them.

READY TO TAKE
ACTION?

Sign Up Now!

EARLY BIRDS EMAIL LIST

Subscribe to get notified of promotional offers, official content, and data releases.

Popular Data & APIs
Some of DataSN's most accessed data sets and APIs in the last 14 days

Browse Data + APIs + Images
HTML / JSON / Excel / CSV / PDF / Media / MySQL / MSSQL / WordPress / Magento / ...

Meet the Team!
Find us at #88, Gaoxin Rd., Xi'an, China. Call us at +86 (158) 0293-6510

FROM THE BLOG

Alec Foster

Marketing Program Manager

“A half century after my my grandma began to build relationships with disadvantaged people in her community, we’re seeing a resurgence of people-focused organizing. I’m excited to be a part of it at DataSN.”

FROM THE BLOG

Alec Foster

Marketing Program Manager

“A half century after my my grandma began to build relationships with disadvantaged people in her community, we’re seeing a resurgence of people-focused organizing. I’m excited to be a part of it at DataSN.”

GET DATA OF ANY
WEBSITE

GOT A DIFFERENT IDEA?

Let's Talk

EARLY BIRDS EMAIL LIST

Subscribe to get notified of promotional offers, official content, and data releases.

Terms of Use | Privacy Policy | Disclaimer | support@datasn.io | +86 (158) 0293-6510 | Shangpin Guoji, 88 Gaoxin Road, Xian 710000, China