Big data processing has been extensively studied in both academia and industry. Various frameworks and distributed processing engines have been developed for general big data processing, such as Hadoop, Spark, and Flink. This project explores implementation details of such distributed engines and designs general techniques for processing big complex streaming data (e.g., graph).
Big data processing | Database | Data mining | Graph analysis | Data structure and algorithms
The research will be conducted in the DKR group (https://unswdb.github.io/). We manage a cluster of servers for experiments of large-scale distributed data. Further HDR positions (MPhil/PhD) and scholarships are available for excellent project students.
Design and implement distributed algorithms for typical problems (e.g., link prediction).