Analytics
  • Introductions
    • 👋Hello Analytics!
    • 👩‍🎓Workflow Philosophy
    • 📹Build a Workflow
    • 🖨️Start from a Template
      • 3 Way Match
      • Accounts Payable - Analytical Review
      • Accounts Payable - Debt Aging and Approval
      • Accounts Receivable - Analytical Review
      • Accounts Receivable - Completeness, Write-Offs, and Manual Adjustment Testing
      • Account Receivable - Credit Aging and Approval
      • Accounts Receivable - Credit Limits and Balance Testing
      • Account Reconciliation
      • Automated Sampling
      • Benford's Law
      • CutOff Testing
      • Change Management Review
      • Disbursement Analysis
      • Fictitious Vendor Testing
      • Financial Account Variance Analysis
      • Fixed Assets - Depreciation Recalculation
      • Fixed Assets - Duplicate Identification
      • Fixed Assets - Negative NBV Check
      • Journal Entries - Analytical Review
      • Journal Entries - Anomaly Detection
      • Journal Entries - Approval and Suspicious Posting Testing
      • Journal Entries - Balanced and Suspicious Amount Testing
      • Journal Entries - Duplicates and Sequence Testing
      • Missing Data Check
      • New User Access Testing
      • Numerical Outlier Detection
      • Payroll - Ghost Employees, Analysis, and Duplicates
      • Payroll - Inactivity and Salary Confirmation
      • Procure to Pay - Duplicates and Suspicious Posting Testing
      • Procure to Pay - Mismatched Documents & Vendor Spend
      • Procure to Pay - Payments Remitted to Employees
      • Procure to Pay - Payment Timing and Terms Test
      • Procure to Pay - Vendor Master File Review
      • Risk Scoring
      • Segregation of Duties
      • Terminated User Access Testing
      • Travel & Expense Testing
      • User Access Review
    • 📓Release Notes
  • Workflows
    • 🎨The Canvas
    • 📏Setting Up
    • 👆Drag & Drop
      • 🖱️Click to Add
    • 🧰Tools
    • 🧮Expression Builder
      • Operators
      • Functions
        • Aggregate Functions
          • AVERAGE
          • CORR
          • COUNT
          • COUNTD
          • COUNTBY
          • COUNTIF
          • COUNTIFS
          • COVAR
          • COVARP
          • COVARS
          • MAX
          • MEDIAN
          • MIN
          • MAXBY
          • MINBY
          • PERCENTILE
          • RUNNINGTOTALBY
          • SIGN
          • SIN
          • SQRT
          • SQUARE
          • STDEV
          • STDEVP
          • STDEVS
          • SUM
          • SUMBY
          • SUMIF
          • TAN
          • VAR
          • VARP
          • VARS
          • ZN
        • Conversion Functions
          • TIMESTAMPTODATE
          • TODATE
          • TODECIMAL
          • TOINT
        • Date/Time Functions
          • DATEADD
          • DATEDIF
          • DATENAME
          • DATENORMALIZE
          • DATEPART
          • DATETRUNC
          • DAY
          • DAYS
          • WORKDAYS
          • HOUR
          • ISDATE
          • ISOWEEKDAY
          • ISOWEEK
          • ISOQUARTER
          • ISOYEAR
          • MAKEDATE
          • MAKEDATETIME
          • MINUTE
          • MONTH
          • NOW
          • QUARTER
          • SECOND
          • TODAY
          • WEEK
          • WEEKDAY
          • YEAR
        • Logical Functions
          • AND
          • BETWEEN
          • CASE
          • CHOOSE
          • CONTAINSWITHIN
          • IF
          • IFS
          • IIF
          • IN
          • IFNULL
          • ISBOOLEAN
          • ISDECIMAL
          • ISDURATION
          • ISINTEGER
          • ISNULL
          • ISNUMBER
          • ISSTRING
          • ISUNIQUE
          • NOT
          • NULL
          • OR
          • SWITCH
          • ALL
            • TOSTRING
          • ANY
        • Math Functions
          • ABS
          • ACOS
          • ASIN
          • ATAN
          • ATAN2
          • CEILING
          • COS
          • COT
          • COSEC
          • DEGREES
          • DIV
          • EVEN
          • EXPONENTIAL
          • FILLINFINITY
          • FLOOR
          • HAVERSINE
          • LOG
          • LN
          • ODD
          • MODULO
          • PERCENTILEOFVALUE
          • PERCENTILEVALUE
          • PI
          • POWER
          • RADIANS
          • RANDOM
          • ROUND
          • SEC
        • Table Functions
          • ENCODE
          • INDEX
          • INDEXBY
          • FILLNULL
          • FIRSTBY
          • GENERATEUNIQUEID
          • LASTBY
          • LOOKUP
          • MATCH
          • NTH
          • OFFSET
          • OFFSETBY
          • PREVIOUSVALUE
          • RANK
          • RANKBY
          • RECORDID
          • ROLLINGAVERAGE
          • ROW
          • RUNNINGAVERAGE
          • RUNNINGMAX
          • RUNNINGMIN
          • RUNNINGSTDEV
          • RUNNINGTOTAL
          • WINDOWAVERAGE
          • WINDOWMAX
          • WINDOWMIN
          • WINDOWCOUNT
          • WINDOWSUM
          • SEQUENCE
          • WINDOWMEDIAN
          • WINDOWSTDEV
          • WINDOWSTDEVP
          • WINDOWSTDEVS
          • WINDOWVAR
          • WINDOWVARP
          • WINDOWVARS
          • WINDOWCORR
          • WINDOWCOVAR
          • WINDOWCOVARP
          • WINDOWCOVARS
          • SMOOTHEDAVERAGE
        • Text Functions
          • ASCII
          • CHAR
          • CONCAT
          • CONTAINS
          • ENDSWITH
          • FIND
          • FINDNTH
          • ISEMPTY
          • JSONPARSE
          • LEFT
          • LENGTH
          • LOWER
          • LTRIM
          • MID
          • PROPER
          • RIGHT
          • RTRIM
          • SPACE
          • SPLIT
          • STARTSWITH
          • TRIM
          • SUBSTITUTE
          • UPPER
        • Window Functions
    • 🔗Parameters
      • File Input Parameters
      • Sample Tool Parameters
      • System Parameters
  • Tools
    • 🗃️Import
      • Import File
      • Import From API
      • Import From Integration
        • GitHub
        • Snowflake
      • Import Sample Data
      • New Table
    • ✨Clean
      • Find Replace
      • ParseJSON
      • Sample
      • Text to Columns
      • Validate
    • 💻Code
      • PythonCode
        • Fuzzy Match
    • 🤝Merge
      • Append
      • Join
    • 🧙Transform
      • Edit Columns
      • Add Columns
      • Select Columns
      • Filter
      • Sort
      • Deduplicate
      • Pivot
      • Unpivot
    • 📊Visualize
      • Chart
        • Bar
        • Line
        • Combo
        • Scatter
        • Histogram
        • Box
        • Pie
        • Area
        • Funnel
    • 📬Publish
      • Publish to Toolkit
      • Publish via Email
  • 📚Dictionary
Powered by GitBook
On this page
  • Input/Output
  • Options
  • Parameters
  • Example
  • Example 2
  • Example 3

Was this helpful?

  1. Tools
  2. Clean

Sample

Last updated 9 months ago

Was this helpful?

Sample offers the ability to view and use a subset of your dataset. You can select this subset of your data using any of the options below.

Input/Output

Inputs
Outputs

Select Rules - rules to choose from for sample

Value, N - N value in rules descriptions (number)

Group By (optional) - column(s) to group by in table. N rows are returned for each group

Table with a subset of the data from your original input dataset. You can select this subset of your data using any of the options below.

Options

Option
Description

Random N rows

Returns a randomized set containing N rows of data. Set is re-randomized each time the workflow is run.

First N rows

Returns every row in the data from the beginning of the data through row N.

Last N rows

Returns the last N rows of the data.

Skip first N rows

Returns all rows in the data starting after row N.

1 every N rows

Returns the first row of every group of N rows.

First N% rows

Returns N percent of rows.

You also have the option to use a groupby that will apply the above rules to each group or rows in the groupby.

Parameters

Example

Let's say we have a list of baseball players that we want to use a sample of:

We only want to see the first 5 players in the list. So we use the First N rows and set N = 5We get the following result:

Example 2

Now we want to only use the first 20% of emails in the list. So we use the First N% rows and set N = 20We get the following result:

Example 3

You are also capable of dictating N value via .

Now we want to only use the first 20% of emails in the list using a parameter. We first create a called Sample. We then use the First N% rows and set N = {param!Sample}We get the following result:

✨
parameter
parameter