Publications

Scalable Query and Analysis for Social Networks

Abstract

E very day, vast amounts of data are being collected from social network (e.g., Twitter) applications, and in response there is a growing need foranalysis methods that can handle this terabyte-size input. To provide an effective and advanced data processing environment for various types of social data analysis such as political discourses, trending topics, evolution of user behavior, social bots detection and orchestrated campaigns, we need to support both query and complex analysis efficiently. Use of high-level scripting languages to solve big data problems has become a mainstream approach for sophisticated data mining and analysis. In particular, high-level interfaces such as Pig, Hive, and Spark SQL are being used on top of the Hadoop framework. This simplifies coding of complex tasks in MapReduce-style systems while improving the flexibility of database systems through user-defined aggregations. In this …

Date
2016
Authors
Tak-Lon Stephen Wu, Bingjing Zhang, Emilio Clayton Davis, Alessandro Flammini Ferrara, Filippo Menczer, Judy Qiu
Book
Big Data in Complex and Social Networks
Pages
47-72
Publisher
Chapman and Hall/CRC