Sample

Sample offers the ability to view and use a subset of your dataset. You can select this subset of your data using any of the options below.

Input/Output

Inputs
Outputs

Select Rules - rules to choose from for sample

Value, N - N value in rules descriptions (number)

Group By (optional) - column(s) to group by in table. N rows are returned for each group

Table with a subset of the data from your original input dataset. You can select this subset of your data using any of the options below.

Options

Option
Description

Random N rows

Returns a randomized set containing N rows of data. Set is re-randomized each time the workflow is run.

First N rows

Returns every row in the data from the beginning of the data through row N.

Last N rows

Returns the last N rows of the data.

Skip first N rows

Returns all rows in the data starting after row N.

1 every N rows

Returns the first row of every group of N rows.

First N% rows

Returns N percent of rows.

You also have the option to use a groupby that will apply the above rules to each group or rows in the groupby.

Parameters

You are also capable of dictating N value via parameter.

Example

Let's say we have a list of baseball players that we want to use a sample of:

We only want to see the first 5 players in the list. So we use the First N rows and set N = 5We get the following result:

Example 2

Now we want to only use the first 20% of emails in the list. So we use the First N% rows and set N = 20We get the following result:

Example 3

Now we want to only use the first 20% of emails in the list using a parameter. We first create a parameter called Sample. We then use the First N% rows and set N = {param!Sample}We get the following result:

Last updated

Was this helpful?