Transform your source code into fine-tuning datasets for Falcon 40B and other LLMs
Drag & drop your source files here
or
Supported: Python, C/C++, Rust, Go, JavaScript, Java, PHP, Ruby, TypeScript
{{ previewData[selectedSampleIndex].input }}
Your processed dataset will appear here
Automatically detects and processes code in Python, C/C++, Rust, Go, JavaScript and more.
Automatically generates meaningful instruction-output pairs from your source code.
Tokenization and formatting specifically optimized for Falcon 40B model fine-tuning.