MR3 is a new execution engine for Hadoop. Similar in spirit to Tez, it is a new execution engine with simpler design, better performance, and more features. MR3 is ready for production use as it supports all major features such as Kerberos-based security, authentication and authorization, fault-tolerance, and recovery. MR3 is implemented in Scala.
Hive on MR3. Hive, the de facto standard for SQL queries in Hadoop, currently supports three execution engines for its backend -- MapReduce, Tez, and Spark. Now Hive can run on top of MR3 as well. Hive on MR3 generally runs faster than Hive on Tez by virtue of the simple architectual design of MR3. In particular, it yields a higher throughput for concurrent queries by making a better utilization of computing resources. Hive 2 and 3 also support an execution mode called LLAP (Low Latency Analytical Processing) designed for interactive queries. In comparision with Hive with LLAP, Hive on MR3 allows elastic allocation of cluster resources, provides better support for concurrency, and fully implements impersonation.
We are hiring developers who will contribute to the MR3 project. Prior experience with Scala programming or Hadoop is not required, but applicants should have working experience with Linux-based distributed Java programming. Applicants are invited to contact Professor Sungwoo Park (gla at postech.ac.kr). We communicate in English for technical discussions, so applicants should have a working command of English.
MR3 연구에 참여할 학생을 모집합니다. 빅데이터, Hadoop, 소프트웨어 개발에 관심있는 학생이면 됩니다. MR3는 Scala로 개발하고 있습니다.
Programming Language Laboratory Department of Computer Science and Engineering Pohang University of Science and Technology San 31 Hyoja-dong, Nam-gu, Pohang, Gyeongbuk, 790-784 Republic of Korea
Office: PIRL, 354
Web page maintained by Sungwoo Park