teradatamlspk - Teradata Python package for running Spark workloads on Vantage
teradatamlspk - Teradata Python package for running Spark workloads on Vantage
Log in required
To access this download, you must log in.
Details
Overview
teradatamlspk
is a Python package, built as an extension of teradataml, Teradata Python package. Syntax and user accessibility of teradatamlspk
APIs are kept similar to PySpark APIs, allowing, the existing PySpark workloads, that run on Spark engine, can be easily run on Teradata Vantage with minimal changes to migrate PySpark workloads to Vantage.
teradatamlspk
offers another function pyspark2teradataml
that enables conversion of a PySpark script to a teradatamlspk
Python script. It also generates the HTML report for the conversion, that is useful for the user to understand the changes done and also carry out any manual changes in the generated script, so that the script can be run on Vantage.
Dependent Python Packages:
- teradataml >= 20.00.00.03
- PrettyTable
- Nbformat
- pytz
- Prerequisite: Python >= 3.9.0 on the client machine
Download Teradata Vantage Express, a free, fully-functional Teradata Vantage database, that can be up and running on your system in minutes. Please download and read the user guide for installation instructions.
Note that in order to run this VM, you'll need to install VMware Workstation Player, VMware Fusion, VMware Server, VirtualBox, or UTM on your system. For more details, see our getting started guides.
For feedback, discussion, and community support, please visit the Cloud Computing forum.
Specifications
- Version
- Released
- TTU
- OS
- Teradata