To access this work you must either be on the Smith College campus OR have valid Smith login credentials.
On Campus users: To access this work if you are on campus please Select the Download button.
Off Campus users: To access this work from off campus, please select the Off-Campus button and enter your Smith username and password when prompted.
Non-Smith users: You may request this item through Interlibrary Loan at your own library.
Publication Date
2009
Document Type
Honors Project
Degree Name
Bachelor of Arts
Department
Computer Science
Abstract
We present a visualizing tool that allows users to explore hidden information in Wikipedia. Users can discover trends and relationships among pages and the contributors that make revisions to these pages. Our visualizer is coded in Java using Prefuse, a visualizing toolkit for Java that allows developers to create almost any 2D graph they want. Our final visualizer shows the relationship between pages and contributors. We have also measured several performance measures including execution times for our filtering code and memory allocation based on several numbers of pages. With these results we are able to predict a time of approximately 1.64 years; to process the complete set of 9 million pages and contributors. We also concluded that we would need slightly more than 0.7 terabyte of space to store all of the important information. The visualizer is a unique and good starting toolkit which has the capabilities to be expanded further in future implementations.
Recommended Citation
Grascia, Christine M., "2D visualizer for the wikipedia database" (2009). Honors Project, Smith College, Northampton, MA.
https://scholarworks.smith.edu/theses/1452
Smith Only:
Off Campus Download
Comments
1 v. (various pagings) : ill. (chiefly col.) Honors project--Smith College, Northampton, Mass., 2009. Includes bibliographical references.