Data Science UDF Examples

This section contains tutorials and examples that show how to use UDFs for data science.

The information and tutorials in this section demonstrate the capabilities of UDFs in Exasol for data science purposes. In-depth knowledge of data science and the R and Python programming languages is not required.

Exasol offers a 30-day free trial of a public demo system hosted in our own cloud, which contains preloaded datasets and a guided tutorial. The demo system is a static shared system and does not provide the full functionality of Exasol. The tutorials in this section are designed to work with the public demo system.

To get access to the free public demo system, sign up here.

User defined functions (UDFs)

User defined functions (UDFs) allow you to program your own analysis, processing, or generation functions and to execute them in parallel within an Exasol cluster. There are different types of UDFs for different input and output specifications:

  INPUT

SCALAR

Single input

SET

Multiple rows as input

OUTPUT

RETURNS

Single output

Function:

Executed in parallel

Aggregation function:

Not executed in parallel except for GROUP BY

Examples:

my_maximum

json_value

Examples:

my_average

EMITS

Multiple output rows

 

Generator function / Map reduce / ETL UDFs:

Executed in parallel

Analytical function:

Not executed in parallel except for GROUP BY

Examples:

IMPORT_METADATA
(Cloud Storage Extension)

Examples:

IDA_TEST_MODEL
(Python Classification Tutorial)

FLIGHTS_PRED_DEP_DELAY
(R Regression Tutorial)

For more details about how to use UDFs, see UDF Scripts.

UDF tutorials

The following articles in this section contain tutorials for different use cases:

  • Python Classification Tutorial

    Learn how UDFs can be used in a machine learning or data science context using Python. In this tutorial you will learn how to test the accuracy of a model, either from a SQL client or directly from your familiar Python environment such as Jupyter Notebooks.

  • R Regression Tutorial

    Learn how to display JSON data stored in a column as a table using a simple UDF.

  • Geocoding with UDFs

    Learn how to use Exasol to add geocodes to your data.

  • Train and Predict with R through UDFs

    Learn how to use Exasol with R and train and predict using UDFs.