AI Dataset Formatter

Convert CSV, TSV, or JSON data to JSONL format for LLM fine-tuning with role mapping.

Transform structured data into JSONL fine-tuning format. Paste CSV, TSV, or JSON data and the tool auto-detects the format and columns. Map columns to system, user, and assistant roles, then export as JSONL with standard messages format. Supports downloading the output file.

Paste data above to begin formatting

Related Tools

Learn More

FAQ

What JSONL format does this tool output?
Each line is a JSON object with a "messages" array containing objects with "role" (system/user/assistant) and "content" fields. This matches OpenAI's fine-tuning format and is compatible with most LLM providers.
What if my data does not have a system column?
You can leave the system column unassigned — the tool will simply omit the system message from those rows. You can also assign the same column to multiple roles if needed.
How large a dataset can I process?
The tool processes data client-side in the browser. It handles thousands of rows comfortably. For very large datasets (100K+ rows), consider splitting the file and processing it in batches.