Using PySpark
Apache Spark provides APIs in non-JVM languages such as Python. Many data scientists use Python because it has a rich variety of numerical libraries with a statistical, machine-learning, or optimization focus.
Continue reading:
Page generated July 8, 2016.
<< Running Spark Applications on YARN | ©2016 Cloudera, Inc. All rights reserved | Running Spark Python Applications >> |
Terms and Conditions Privacy Policy |