Work Description
Title: English WikiProject coeditor networks and quality assessments Open Access Deposited
Attribute | Value |
---|---|
Methodology |
|
Description |
|
Creator | |
Depositor |
|
Contact information | |
Discipline | |
Funding agency |
|
ORSP grant number |
|
Keyword | |
Citations to related material |
|
Resource type | |
Last modified |
|
Published |
|
Language | |
DOI |
|
License |
(2018). English WikiProject coeditor networks and quality assessments [Data set], University of Michigan - Deep Blue Data. https://doi.org/10.7302/Z2610XJB
Relationships
- This work is not a member of any user collections.
Files (Count: 5; Size: 68.3 GB)
Thumbnailthumbnail-column | Title | Original Upload | Last Modified | File Size | Access | Actions |
---|---|---|---|---|---|---|
![]() |
Readme.md | 2018-10-11 | 2018-10-11 | 2.29 KB | Open Access |
|
![]() |
agent_based_model_code_git.tgz | 2018-06-25 | 2018-12-18 | 1.41 MB | Open Access |
|
![]() |
coeditor_networks.tgz | 2018-06-25 | 2018-12-18 | 35.4 GB | Open Access |
|
![]() |
wikiproject_code_git.tgz | 2018-06-25 | 2018-12-18 | 2.67 MB | Open Access |
|
![]() |
wikiproject_data.tgz | 2018-06-25 | 2018-12-18 | 32.8 GB | Open Access |
|
This archive contains data and code for the analysis presented in
(Platt & Romero, 2018). This study analyzed relationships between structural
properties of WikiProject co-editor networks and the performance/efficiency
of those WikiProjects. Co-editor networks were constructed from the entire
Wikipedia edit history by creating an edge between two editors if they had
edited the same article. The performance and efficiency of a WikiProject was
determined from the history of WikiProject article quality assessments.
In addition to the co-editor network, the project included agent-based model
simulations, which did not rely on any empirical data. The code for these
simulations is included in agent_based_model_code_git.tgz
.
The code and data for the empirical WikiProject analysis are contained in
the wikiproject_code_git.tgz
and wikiproject_data.tgz
files respectively.
For convenience, the co-editor networks are also included by themselves in
the coeditor_networks.tgz
file, using adjacency list format.
Reproducing the analysis also relies on external data sources that
have been archived elsewhere, linked below.
The data and code is organized into several directories, each containing a
Readme.md file with adittional information.
Contents
agent_based_model_code_git.tgz
: git repository containing code for agent-based models. This repository includes a copy of thelogbook
module released under the 3-clause BSD license.coeditor_networks.tgz
: English-lanaguage WikiProject coeditor networks in adjacency-list format. Node ids are Wikipedia editor ids. WikiProject ids are mapped to titles inwikiproject_data.tgz
.wikiproject_code_git.tgz
: git repository containing code for empirical analysis of performance and efficiency of WikiProject co-editor networks.wikiproject_data.tgz
: Data sets used by empirical analysis scripts.
External data
- Wikipedia contributors. (2015). Wikimedia database dump of the English Wikipedia on December 01, 2015. The Internet Archive https://archive.org/details/enwiki-20151201
References
- Platt, E. L. & Romero, D. M. (2018). Network Structure, Efficiency, and Performance on WikiProjects. In ICWSM.
Data citation
- Platt, E. L., Livneh, D., Ramanathan, K., and Romero, D. M. (2018). English WikiProject coeditor networks and quality assessments.