Procogiavia Greenhouse

LLM Research Intern

Vancouver, BCPosted 1d ago

NLP / LLMEntry LevelFull-time

Not sure if you're a good fit?

Upload your resume and TixelJobs AI will compare it against LLM Research Intern at Procogia. Get a match score, missing keywords, and improvement tips before you apply.

Free preview · Your resume stays private

About the Role

About ProCogia:

We help businesses transform data into real growth!

Our clients operate in high-stakes, highly regulated industries (such as telecom, financial services, life sciences, and more), where precision, compliance, and measurable outcomes are non-negotiable. We partner with them by embedding expert data science, engineering, and AI talent directly into projects that matter.

We’re a diverse, close-knit team with a shared goal: delivering top-class, end-to-end data solutions. We don’t just analyse data, we push the boundaries of what’s possible, helping clients unlock new value and insights.

When you join ProCogia, you’ll find a supportive, growth-driven environment where your ideas are welcomed, and your development is prioritized. We offer competitive salaries, generous benefits and perks for personal and professional development.

If you’re ready to unleash your potential and work at the cutting edge of data consulting, we’d love to meet you!

The core of our culture is maintaining a high level of cultural equality throughout the company. Our diversity and differences allow us to create innovative and effective data solutions for our clients.

Our Core Values: Trust, Growth, Innovation, Excellence, and Ownership

Responsibilities

Assess client-specific data assets and determine the appropriate adaptation strategy — continued pretraining, supervised fine-tuning, or a combination — based on the domain, data volume, and use case requirements

Curate, clean, structure, and prepare domain-specific datasets from raw client data for use in model training pipelines

Fine-tune large language models in the 70B–100B+ parameter range using techniques such as LoRA, QLoRA, and multi-adapter patterns

Perform continued pretraining on open-weight models (Qwen, Llama, and related ecosystems) to embed domain knowledge directly into model weights

Manage distributed training workflows across multi-node GPU clusters

Design and execute evaluation frameworks to validate domain adaptation quality, factual grounding, and model behavior

Support RAG system development where applicable, including vector database integration, chunking strategies, and reranking pipelines

Contribute to inference optimization and deployment pipeline integration

Required Qualifications

Ready to apply?

This job is active. Apply now to get in early.

Similar Jobs

LLM Engineer

Algoscale

Generative AI Engineer

Jobix.ai

Generative AI Engineer

Clevanoo LLC

Generative AI Engineer - Remote

North Star Staffing

View all jobs