数据挖掘_Epinions datats(Epinions数据集)

更新时间:2023-06-29 04:48:49 阅读: 评论:0

Epinions datats(Epinions数据集)
数据摘要:
it contains the ratings given by urs to items and the trust statements issued by urs.
中文关键词:
Epinions,数据集,信息,信任度,等级,
英文关键词:
Epinions,datats,information,trust metrics,ratings,
数据格式:仰慕的意思
TEXT
数据用途:
Social Network Analysis
Information Processing
Classification
数据详细介绍:
Epinions datats
The datat was collected by Paolo Massa in a 5-week crawl (November/December 2003) from Web site.
The datat contains
49,290 urs who rated a total of
139,738 different items at least once, writing
成都大学医护学院
664,824 reviews.
487,181 issued trust statements.
Urs and Items are reprented by anonimized numeric identifiers.
The datat consists of 2 files.
Contents
1 Files
1.1 Ratings data
1.2 Trust data
1.3 Data collection procedure
2 Papers analyzing Epinions datat
Ratings data
bz2 (2.5 Megabytes): it contains the ratings given by urs to items. Every line has the following format:
ur_id item_id rating_value世界大学排名前十
For example,
23 387 5
reprents the fact "ur 23 has rated item 387 as 5"
Ranges:
七夕散文
ur_id is in [1,49290]
item_id is in [1,139738]
rating_value is in [1,5]
Trust data
bz2 (1.7 Megabytes): it contains the trust statements issued by urs. Every line has the following format:
source_ur_id target_ur_id trust_statement_value
For example, the line立春祝福短信
22605 18420 1
reprents the fact "ur 22605 has expresd a positive trust statement on ur 18420"
Ranges:
source_ur_id and target_ur_id are in [1,49290]
trust_statement_value is always 1 (since in the datat there are only positive trust statements and not negative ones (distrust)).
花儿朵朵Note: there are no distrust statements in the datat (block list) but only trust statements (web of trust), becau the block list is kept private and not shown on the site.
Data collection procedure
The data were collected using a crawler, written in Perl.
It was the first program I (Paolo Massa) ever wrote in Perl (and an excu for learning Perl) so the code is probably very ugly. Anyway I relea the code under the GNU Generic Public Licence (GPL) so that other people might be u the code if they so wish.
is the version I ud, this version pars the HTML and saves minimal information as perl objects. Later on, I saw this was not a wi choice (for example, I didn't save demographic information about urs which might have been uful for testing, for example, is urs trusted by ur A comes from the same city or region). So later on I created a version that saves the original HTML pages
(epinionsRobot_) but I didn't test it. Feel free to let me know if it works. Both Perl files are relead under GNU Generic Public Licence (GPL), e first lines of the files. --PaoloMassa
酒店运营方案Be aware that the script was working in 2003, I didn't check but it is very likely that the format of HTML pages has changed significantly in the meantime so the script might need
some adjustments. Luckily, the code is relead as open source so you can modify it. --Paolo Massa 11:34, 16 July 2010 (UTC)印章公司
Papers analyzing Epinions datat
Trust-aware Recommender Systems
add another paper!
Retrieved from "ustlet/wiki/Downloaded_Epinions_datat"
数据预览:

本文发布于:2023-06-29 04:48:49,感谢您对本站的认可!

本文链接:https://www.wtabcd.cn/fanwen/fan/89/1059668.html

版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系,我们将在24小时内删除。

上一篇:旅费证
标签:数据   信息   成都
相关文章
留言与评论(共有 0 条评论)
   
验证码:
推荐文章
排行榜
Copyright ©2019-2022 Comsenz Inc.Powered by © 专利检索| 网站地图