None

Promtec Innovation • mérida, yucatán • Posted June 27, 2026
About the Role

We're building something that doesn't exist yet in 
Latin America: a domain-specific large language model 
trained on a large Spanish-language corpus, deployed 
on proprietary on-premise hardware, solving a real 
problem for clients who are already waiting. 

We can't tell you exactly what it is yet. 
What we can tell you: 

→ The corpus is real and large 
→ The clients are real and paying 
→ The hardware is ready 
→ The team is small and the decisions matter 
→ The person who joins now shapes the architecture 

This is a fully on-site role in Mérida, Yucatán. 
We want someone in the room — not because we don't 
trust remote work, but because the knowledge needs 
to live in the team, not in one person's laptop. 

────────────────────────────── 
  WHAT YOU'LL BUILD 
────────────────────────────── 
→ Large-scale Spanish text dataset pipeline: 
 cleaning, deduplication, tokenization 
→ Continual Pre-Training (CPT) on open-source 
 base models (Llama/Qwen family) on dedicated 
 GPU workstation — in our office 
→ Supervised fine-tuning: SFT with LoRA/QLoRA, 
 HuggingFace + TRL 
→ RLHF/DPO pipeline with domain expert annotators 
→ Model quantization for on-premise deployment: 
 GGUF, MLX, llama.cpp 
→ RAG system on PostgreSQL + pgvector 
→ Evaluation suite + hallucination monitoring 

────────────────────────────── 
✅ YOU NEED THIS 
────────────────────────────── 
→ Python — advanced and demonstrable 
→ Real ML framework experience: PyTorch, 
 TensorFlow, or Scikit-learn with actual 
 projects, not just certifications 
→ Mathematical foundation: linear algebra, 
 stats, calculus — things you actually use 
→ Linux CLI — the training workstation runs 
 Linux, full stop 
→ English — reading ML papers and HuggingFace 
 docs is part of the daily job 
→ Spanish native or C2 — the corpus is in 
 Spanish 
→ Based in Mérida, Yucatán — fully on-site, no exceptions 

────────────────────────────── 
⭐ BONUS POINTS 
────────────────────────────── 
→ HuggingFace Transformers / TRL / PEFT 
→ LLM fine-tuning: SFT, LoRA, QLoRA, DPO 
→ RAG pipelines and vector databases (pgvector) 
→ Ollama, llama.cpp, MLX (Apple Silicon) 
→ Data engineering: ETL, scraping, text pipelines 
→ Docker + CI/CD basics 

────────────────────────────── 
  WHAT YOU GET 
────────────────────────────── 
  Competitive salary — first call, directly, 
  no invented ranges 
  On-site in Mérida — real team, real 
  collaboration, real knowledge transfer 
 ️ Dedicated GPU hardware — not your laptop