Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
This project automates detection and fixing of Talend job design issues using YAML-based rules, .item job files, and Open Router LLMs (GPT-based). It supports zipped jobs, generates CSV reports, and shows beautifully formatted CLI output.
It Include 10 Production Ready jobs already. Just plug and play. Understand then Reuse.
This project is an AI adaption of my previous one - Qlik Community
Link to Github Repo - Github
Connect with me for suggestions/feedback - LinkedIn
https://www.youtube.com/watch?v=H0oVfQjiWkk
See the full agentic Talend LLM quality fixer in action!
This framework supports 50+ scalable, production-level rule checks tailored for enterprise Talend pipelines.
These cover:
All rules are YAML-defined and support both auto fix and LLM-assisted suggestions.
zipped_jobs/
python talend_AgenticAI_SouraV1.py --verbose
This will:
Visit https://openrouter.ai/
Sign in and generate an API key from:
https://openrouter.ai/keys
Copy the API key (starts with sk-or-...)
Create a .env file in your project root with:
OPENROUTER_API_KEY=sk-or-your-key-here
pip install python-dotenv
This allows llm.py to securely load your OpenRouter key at runtime.
├── talend_AgenticAI_SouraV1.py # Main orchestrator script ├── extract_items_with_delay.py # Extracts `.item` files from zipped_jobs/ ├── validate_syntax.py # Checks YAML rule format ├── src/ │ ├── main.py # Linting and fixing engine │ └── llm.py # LLM handler via OpenRouter API ├── rules/ # YAML rules for code quality ├── zipped_jobs/ # Input zipped Talend exports ├── jobs/ # Extracted .item files ├── fixed/ # Auto-fixed jobs └── reports/ └── fix_summary_report.csv # CSV summary
✅ Extracted job: myJob_0.1.item
job_file rule_id status llm_suggestion confidence myJob.item RULE_036 llm-suggested Defaults in schema don't match data type medium myJob2.item RULE_028 fixed 1.0