H
Hyphenconnectvia Greenhouse
Multimodal AI Systems Architect (AI Engineering)
AustraliaPosted 2w ago
ML EngineerLeadFull-time
Not sure if you're a good fit?
Upload your resume and TixelJobs AI will compare it against Multimodal AI Systems Architect (AI Engineering) at Hyphenconnect. Get a match score, missing keywords, and improvement tips before you apply.
Free preview · Your resume stays private
About the Role
We are seeking a talented Multimodal AI Systems Architect to develop and optimize AI systems that seamlessly integrate vision and audio models. This role focuses on enhancing our voice-to-voice interactions and multimodal retrieval capabilities, ensuring our systems are efficient and innovative.
Responsibilities:
- Integrate vision encoders and audio-native models into core agent reasoning loops.
- Optimize streaming latency for voice-to-voice AI interactions.
- Architect multimodal RAG systems capable of retrieving insights from videos and PDFs.
Qualifications:
- Experience with Whisper, CLIP, and multimodal LLM integration.
- Knowledge of streaming architectures and WebRTC.
- Expertise in cross-modal alignment.
Ready to apply?
This job is active. Apply now to get in early.