Facilitating Analysis of Big Data on Reddit via an Easy to Use Visualisation Tool (bibtex)
by Goncalves, Jorge, Klakegg, Simon, van Berkel, Niels and Hosio, Simo
Abstract:
With the rapid proliferation of social media sites, researchers have increasingly turned to data generated from these platforms to investigate human behaviour. In this paper we report the design and implementation of the RDV (Reddit Data Visualisation) platform, a visualisation tool aimed at facilitating the analysis of a publicly available Reddit dataset, which contains  1.7 billion JSON objects collected from October 2007 to October 2015. RDV allows for researchers without advanced coding skills to easily analyse this dataset, while also providing a tailor-made platform to account for the intricacies of any dataset originating from Reddit. We showcase the features of the platform through an example of data analysis using the Reddit dataset: the 2015 United Kingdom general elections. Finally, we conclude by discussing the need for better and simpler visualisation tools for non-technical researchers to analyse Big Online Behavioural Datasets, and report our ongoing work in this area.
Reference:
J. Goncalves, S. Klakegg, N. van Berkel, S. Hosio, "Facilitating Analysis of Big Data on Reddit via an Easy to Use Visualisation Tool", in Proceedings of the British Human Computer Interaction Conference (British HCI'18), 2018, 1-6.
Bibtex Entry:
@inproceedings{Goncalves2018BigDataReddit,
	Abstract = {With the rapid proliferation of social media sites, researchers have increasingly turned to data generated from these platforms to investigate human behaviour. In this paper we report the design and implementation of the RDV (Reddit Data Visualisation) platform, a visualisation tool aimed at facilitating the analysis of a publicly available Reddit dataset, which contains ~1.7 billion JSON objects collected from October 2007 to October 2015. RDV allows for researchers without advanced coding skills to easily analyse this dataset, while also providing a tailor-made platform to account for the intricacies of any dataset originating from Reddit. We showcase the features of the platform through an example of data analysis using the Reddit dataset: the 2015 United Kingdom general elections. Finally, we conclude by discussing the need for better and simpler visualisation tools for non-technical researchers to analyse Big Online Behavioural Datasets, and report our ongoing work in this area.},
	Author = {Goncalves, Jorge and Klakegg, Simon and van Berkel, Niels and Hosio, Simo},
	Booktitle = {Proceedings of the British Human Computer Interaction Conference},
	Location = {British HCI'18},
	Pages = {1-6},
	Title = {Facilitating Analysis of Big Data on Reddit via an Easy to Use Visualisation Tool},
	Type = {Conference Paper},
	Url = {https://nielsvanberkel.com/files/publications/bhci2018a.pdf},
	Year = {2018}}
Powered by bibtexbrowser