2.数据相关1.数据仓库3.spark 2017-11-17 spark-streaming spark streaming k-means decay(forgetfulness) mini-batch k-means c_t+1 = [(c_t * n_t * a) + (x_t * m_t)] / [n_t + m_t] n_t+t = n_t * a + m_t Broadcast Variables refhttps://databricks.com/blog/2015/01/28/introducing-streaming-k-means-in-spark-1-2.html