Vexdata.io Documentation
  • Vexdata Overview
  • Installation
    • System Requirements
    • Windows Installation
    • Linux Installation
    • Cloud Installation
      • GCP
      • Azure
      • AWS Cloud Deployment Architecture Diagram
        • Pre-requisites
        • Manual Install on ec2
        • Cloud Formation Scripts (command line)
    • Admin Post Installation Setup
    • User Post Installation Setup
    • Server Improvement (Optional)
  • Getting Started
    • Roles and Permissions
    • Managing Groups and Projects
      • Folders
      • Projects
    • Settings
      • Slack Integration
      • Integrations
      • Rules
      • Properties
      • Integrations
      • Email Template
      • Report Template
      • Create Data Connections
        • Databases/Warehouses
        • Run On
  • Creating and Executing Test Cases
    • Test Case
      • Create Test Cases
        • Define Data Source and Target
          • Configure Table/s from Database
          • Input SQL Data
          • Input Files
            • Excel/CSV
            • XML
            • JSON
            • Parquet/AVRO
          • API Input
          • Table Filter
          • Advance - Input
        • Data Compare
          • ETL Testing/Cell By Cell Compare
            • Data Compare Mapping
            • Data Compare Options
          • Schema Compare
          • Counts Compare
        • Schema Monitoring
        • Data Quality
          • SQL Rules
          • Column Level Rules
          • Duplicates
      • Executing Test Cases
      • Defining Test Case Settings
    • Test Runs
    • Variables
    • Note
    • Label
  • Reports
    • User Reports
    • Admin Report or Manager Reports
  • Current Limitations
Powered by GitBook
On this page
  1. Creating and Executing Test Cases
  2. Test Case
  3. Create Test Cases
  4. Define Data Source and Target
  5. Input Files

JSON

PreviousXMLNextParquet/AVRO

Last updated 5 months ago

JSON files can be used as data input by uploading a file, specifying an _s3 _path (s3a://<file_path>), or providing an HDFS location.

  • Multiline options can be set to false if the JSON structure is in single line. Default value for this is set to True.

  • JSON can be flattened, by checking the _Flatten Data _checkbox, if the JSON structure is in hierarchical format.

  • JSON elements can be selected by specifying element names in the columns section and can be renamed using them as keywords. E.g. if id, first__name and last_name are the only three elements to be selected from a JSON with many distinct elements: isbn as id, author.firstName as first_name and author._lastName as last_name.

Multi-line False sample:

{ "isbn": "123-456-222","lastname": "Doe","firstname": "Jane"}
{"isbn": "123-456-777","lastname": "Smith","firstname": "Jane"}

Multi-line True sample:

[
	{ "isbn": "123-456-222","lastname": "Doe","firstname": "Jane"},
	{"isbn": "123-456-777","lastname": "Smith","firstname": "Jane"}
]

Root level columns can be selected above in a multi-root JSON structure. For selection of nested columns, use the Select Columns shape from the palette.

JSON File