What is Hivemall?
- Scalable Machine Learning
- A collection of hive UDFs
- ML on SQL
Why Hivemall?
- Scalable (compare to scikit-learn)
- Easy To Use (compare to scikit-learn)
Introduction
https://www.slideshare.net/myui/introduction-to-hivemall
Extended: top open sources big data tools 2016
- Spark
- Beam
- TensorFlow
- Solr
- ElasticSearch
- SlamData
- Impala
- Kylin
- Kafka
- StreamSets
- Titan
- Zeppelin