You’d be forgiven for passing by the announcement of Apache Spark 2.3. After all, it’s a point release, isn’t it? Sure, there will be some bug fixes, maybe an improvement or two to the MLLib framework, maybe an extra operator or something, but nothing all that major. That will be saved for 3.0, surely?

In fact, this is no mere point release. Apache Spark 2.3 ships with two major new features, one of which is perhaps the biggest (and often-requested) to operations since Spark was added to the project. The other is native integration with to execute Spark jobs in container clusters.

To read this article in full, please click here



Source link
Bigdata and center
thanks you RSS link
( https://www.itworld.com/article/3262995/spark/-new-in-apache-spark-low-latency-streaming-and-kubernetes.html#tk.rss_bigdata)

LEAVE A REPLY

Please enter your comment!
Please enter your name here