The proliferation of 5G and beyond facilitates the advancement of next-generation technologies, including smart cities, self-driving cars, online video gaming, virtual reality, and augmented reality. This necessitates a re-evaluation of how these services are characterized and deployed. Serverless computing is an emerging paradigm, referring to a software architecture where an application is decomposed into triggers (also called events) and actions (also called functions), and there is a platform that provides seamless hosting and execution environment, making it easy to develop, manage, scale, and operate them. Read More
Today many big data analytics applications (e.g., fraud detection, social data analytics, education, climate modeling, epidemiology, and finance) need to process enormous datasets from geographically distributed locations. An emerging trend is to host these big data analytics applications in the public cloud. They can be packaged to run in a lightweight isolated execution environment (containers) and deployed on computing resources rented from public cloud providers, which can be updated and scaled seamlessly. However, the complex inter-container correlations and the heterogeneity of hardware resources pose significant challenges in managing these big data analytics applications in the public cloud. This project enables the easy deployment of containerized big data analytics applications in the public cloud and provides cloud providers with insights to better tune their systems for current and future big data workloads. Read More
Internet-of-Things (IoT) applications such as self-driving cars, augmented reality, interactive gaming, and event monitoring have a tremendous potential to improve our lives. These applications generate a large influx of sensor data at massive scales. Under many time-critical scenarios, these massive data streams must be processed in a very short time to derive actionable intelligence. This CAREER project aims to support time-critical IoT applications by applying the stream processing paradigm to the Edge computing architecture in the dynamic, heterogeneous Edge environment. As an integral part of its research program, this CAREER project involves K-12, undergraduate and graduate level education in partnership with the local Public School system. Read More
Selected Publications: ICS'22, USENIX ATC'21
Our society increasingly relies on applications that process streaming data across geo-distributed sites, such as making business decisions from marketing data, identifying spam campaigns in social network streams, and analyzing genome datasets in different labs and countries to track the sources of potential epidemics. Existing systems are mainly designed for stateless stream processing in intra-datacenter settings and do not scale well for running stream applications that contain large distributed states. This project breaks the traditional abstractions of a centralized architecture and hashtable-based stateless operators, redefining them with a new decentralized architecture and new memory-efficient stateful operators, which enables novel approaches to improve overall system performance and scalability. Read More
Selected Publications: IEEE Access, Middleware'20, IPDPS'20, ICAC'18, USENIX ATC'14
The broad success of online social networks (OSNs) has created fertile soil for the emergence and fast spread of rumors. Rumors are damaging as they cause public panic and social unrest. There is an urgent need to quickly detect false rumors (e.g., fake news) circulated on social online networks early in its propagation before it reaches a broad audience. The goal of this project is to develop a next-generation online scalable streaming system for early rumor detection in OSNs. Read More
Selected Publications: Cloud'19 Best student paper, Cloud'18