Overview

Components Overview

In Data Toolchain for AI, each node represents a functional component within your pipeline. A node can either bring data in (source), process or transform it, or store the output. Nodes can be connected together on the workflow canvas to build end-to-end data flows.

Aparavi is actively expanding the list of supported nodes.

Add and Configure a Component

To add a component, click on the + at the bottom right of the screen

which will open the Component Panel

Once you select the component, you can set the properties by clicking the GEAR icon. If you hover over the Component, you can also copy it or delete it.

You then can add another Component and link them together by matching the nodes (Text goes to Text, not Document, etc)

Component Types

Audio Components

Processes audio input β€” this may include transcription, format conversion, or embedding audio for ML usage.

Database Components

Connects to SQL or NoSQL databases, allowing for queries, lookups, or data enrichment within a pipeline.

Embedding Components

Transforms text or images into vector embeddings, enabling downstream AI models to work with structured representations of unstructured content.

Example – Embedding – Image.

Image Components

Handles image-specific tasks like resizing, format conversion, or preprocessing before embedding or analysis.

Infrastructure Components

Nodes for orchestration, automation, or triggering β€” potentially linked to Webhooks, schedules, or external systems.

LLM Components

Large Language Models used to generate, summarize, classify, or transform text based on context or input from previous nodes.

Preprocessor Components

Preprocessor components prepare data for further processing by cleaning, formatting, and transforming it.

Source Components

Connects to external systems and data repositories to bring raw or unstructured data into the workflow.

Examples – Google Drive, AWS S3, Outlook, SMB/NAS, Local File System.

Store Components

Used to write and persist processed output to a destination, such as cloud buckets, local drives, or database tables.Β 

Text Components

Text manipulation utilities such as extraction, formatting, cleansing, or splitting operations within the workflow.

Other Components

Utility nodes that help manage data structures, formats, and flow β€” such as joins, filters, or conditional logic.