该文章数据库主要为来自中国网络上67个类别,1,099名作者,1,390,470篇包括金融股市、保险债券、基金投资等方向的文章。除此之外,还有10,534个问答和2,346,796文章评论。
This is a fields-rich finance, economics and stock articles database having 1,390,470 records with content, author id, create timestample, retweet count, reply count, praise count, create time, etc. in each. All these articles are written by 1,099 authors in 67 categories. It contians 10,534 answers and 2,346,796 article comment records for these articles. There are 684,413 article images in the 30.96G media set. The whole finance, economics, stock articles and answers database totally has 14 tables.
Name | n3_lyz_xueqiu.com_people |
---|---|
Data | 5.20G (+ 0B) |
Tables | 14 (+ 0) |
Columns | 81 (+ 0) |
Table Rows | 7,852,776 (+ 0) |
Media | 30.96G (+ 0B) |
Files | 413,364 (+ 0) |
Tables & Columns
Tables | Rows | Columns | Non-empty |
---|---|---|---|
anwser | 10,534 | answer_timestampe |
100%
|
anwser |
100%
|
||
author_id |
100%
|
||
anwser_x_article | 10,557 | anwser_id |
100%
|
article_id |
100%
|
||
article | 1,390,470 | article_title |
17.73%
|
content |
64.71%
|
||
author_id |
100%
|
||
create_timestample |
65.06%
|
||
retweet_count |
65.06%
|
||
reply_count |
65.06%
|
||
praise_count |
65.06%
|
||
create_time |
59.97%
|
||
update_time |
0%
|
||
article_comment | 2,346,796 | comment_idt |
100%
|
comment_content |
100%
|
||
article_id |
100%
|
||
comment_timestample |
100%
|
||
article_image_slug | 684,413 | article_id |
100%
|
article_image_slug_c | 684,413 | article_id |
100%
|
author | 1,099 | screen_name |
100%
|
description |
100%
|
||
province |
86.17%
|
||
city |
86.17%
|
||
location |
0%
|
||
fucos_count |
100%
|
||
fans_count |
100%
|
||
authentication |
29.48%
|
||
feature |
4.73%
|
||
category | 67 | category |
100%
|
category_x_author | 1,818 | category_id |
100%
|
author_id |
100%
|
||
comment_x_reply_remark | 1,019,086 | article_comment_id |
100%
|
reply_remark_id |
100%
|
||
reply_remark | 752,107 | comment_idt |
100%
|
comment_content |
98.93%
|
||
comment_timestample |
100%
|
||
retweet | 475,269 | retweeted_timestample |
100%
|
retweeted_text |
100%
|
||
author_id |
100%
|
||
retweet_x_article | 476,147 | retweet_id |
100%
|
article_id |
100%
|
Notes:
- Meta columns id and ts are not shown here, but included in columns total.
- Some statistical or memo columns are not shown here, but included in columns total.
- Non-empty percentage indicates the percent of values that are non-empty for that particular column.
Media Sets
ARTICLE_IMAGE
Size (Bytes) | Files |
---|---|
30.96G | 413,364 |