Research Engineer・Research Scientist (Creative Vision)

Salary: 650 - 1800 百万円

PyTorch
SB Intuitions

Job Description

職務内容 (Responsibilities)

  • Training large-scale image, video, and multimodal generation models, ensuring advanced performance and scalability.
  • Translating cutting-edge research into practical solutions by investigating downstream applications that enhance the model’s business value, and delivering actionable insights to the product team.
  • Presenting research findings at premier international conferences, building connections with global research communities.
  • 大規模な画像・動画・マルチモーダル生成モデルの開発。
  • 市場の動向を調査し、プロダクトチームと連携し研究成果を実用的なプロダクトに繋げる。
  • トップカンファレンスでの研究成果の発表、グローバルな研究者コミュニティとのコネクション構築。

応募資格(必須) (Required Qualifications)

  • (Research Scientist) First-author publications on visual or multimodal generation (or related topics), in top conferences and journals (e.g., ICCV, CVPR, NeurIPS, ICML, ICLR, ECCV, ACL, TPAMI, IJCV, or equivalent).
  • (Research Engineer) Hands-on experience in large-scale generative model training or fine-tuning.
  • Master’s or Ph.D. in Computer Science, Data Science, or a related field, with a strong foundational understanding.
  • Proven research (for scientist) or industrial (for engineer) experience in visual or multimodal generation.
  • Willingness to work collaboratively within a team.
  • Proficiency in PyTorch or similar deep learning frameworks.
  • Solid understanding and experience with recent visual generative models (diffusion, flow matching, AR).
  • Self-motivated with a passion for continuous learning and staying current with new technologies.
  • Willingness to pursue industrial research aimed at practical real-world solutions, not just academic publication.

応募資格(歓迎) (Preferred Qualifications)

  • (Research Engineer) Experience with large-scale distributed training (FSDP/DeepSpeed/Megatron-LM), low precision training, and model parallelism.
  • Contributions to open-source AI projects or significant GitHub projects related to visual/multimodal generation.
  • Knowledge of machine learning techniques for data collection, cleaning, and pre-processing.
  • Experience handling non-visual modalities, such as audio, speech, or language.

ミッション (Mission)

  • Research Scientist: Design, implement, and evaluate novel algorithms, models, and techniques for visual and multimodal generation.
  • Research Engineer: Focus on implementation and optimization, translating research concepts into practical multimodal AI model development, and building/maintaining robust data pipelines.

求める人物像 (Ideal Candidate Profile)

  • Alignment with the mission and willingness to take on new challenges.

仕事の魅力 (Benefits of the Role)

  • Opportunity to participate in large multimodal generation model training and development projects.
  • Apply research to real-world applications with measurable business impact.
  • Work with a diverse, international team based in Tokyo.
  • Access to the largest computation resources in Japan.
  • Competitive compensation package.

雇用形態 (Employment Type)

  • Full-time employee (正社員)

備考 (Notes)

  • Interviews are primarily conducted online.
  • Specific interview details will be provided individually.

勤務時間 (Working Hours)

  • Flexible working hours available.
  • Standard hours: 9:00 AM – 5:45 PM (1 hour break).
  • Overtime: Possible.

賃金・賞与 (Salary & Bonus)

  • Monthly salary: ¥541,667 – ¥1,500,000
  • Estimated annual salary: ¥6,500,000 – ¥18,000,000 (no strict upper limit)
  • Salary includes base + fixed overtime (35 hours).
  • Additional incentives possible.
  • Overtime pay applicable beyond fixed hours (for non-managerial positions).

諸手当 (Allowances)

  • Commuting allowance (up to ¥150,000/month)
  • Overtime, late-night, holiday work allowances (general roles)
  • Management allowance, late-night, commute allowance (manager roles)

休日・休暇 (Holidays/Vacation)

  • Full two-day weekends (Sat/Sun), national holidays
  • Year-end/New Year holiday (Dec 29–Jan 3)
  • Paid annual leave (6–21 days, depending on hire month)

福利厚生 (Benefits)

  • Health, employment, workers’ accident insurance, employee pension
  • Additional benefit programs (Benefit One, premium services)
  • Savings & welfare insurance, group insurance schemes
  • Defined contribution pension (401K)

出社頻度 (Work Location/Frequency)

  • Hybrid: Best mix of remote and on-site work (no fixed requirement)
    • Average: 1–2 days per week on-site
    • Remote is possible for those residing far from the office (pre-approval required)
    • Travel costs for on-site work are covered (with limits)