Data Engineer (Databricks)

ERNI

面议
现场办公 - 曼陀罗勇应届毕业生/学生专科全职
分享

职位描述

职位描述

Founded in 1994 and headquartered in Switzerland, ERNI is a leading Software Development company with over 800 employees worldwide. Specializing in IT and software engineering, we drive innovation in process and technology. Our first service center in Asia Pacific, located in Metro Manila (Mandaluyong), supports clients across Europe, APAC, the Philippines, and the USA. As we continue to grow, we're looking for passionate and motivated individuals to join our team.

Why ERNI is the Perfect Place for You: ?

  • International Exposure: Work with global clients on cutting-edge projects.
  • Inclusive Culture: Thrive in a collaborative and diverse work environment.
  • Career Development: Enjoy continuous learning and professional growth opportunities.

?Perks And Benefits

  • Career Stability: Enjoy a stable career path with ample project opportunities.
  • Immediate Coverage: Private HMO and insurance benefits from day one.
  • Jubilee Celebration: A 5-year milestone includes a complimentary trip to any European ERNI sites.
  • Comprehensive Benefits: Government-mandated benefits including 13th-month pay.
  • Skill Enhancement: Access free training and certifications.
  • Wedding Gift: To celebrate your special day.
  • Baby Basket: To welcome your newborn to the ERNI family.
  • Fruit Basket: Boost of vitamins during hospitalization.
  • Office Perks: Enjoy free snacks and coffee.

?Growth And Opportunities

  • Free Training: Advance your skills through technical and non-technical training.
  • Challenging Projects: Engage in complex software projects across MedTech, Industry,

Finance, and Transportation.

  • Supportive Environment: Benefit from a team dedicated to guiding and supporting your success.
  • Recognition and Advancement: Receive acknowledgment for your efforts and

opportunities for promotion.

  • Open Communication: Experience transparency and value your input in our culture.

⏱Flexibility

  • Hybrid Work Setup: Balance remote and in-person work for better work-life integration.

?Events

  • Connect and Celebrate: Participate in a variety of events including leisure, summer,

family, social, and year-end gatherings.

Experience

?What are our wishes:

  • 7+ years of experience in data engineering roles, with at least 2 years in a leadership role and projects involving Databricks and AWS/Azure.
  • Proven expertise in data pipelines, feature engineering, and dataset preparation for machine learning, specifically LLMs.
  • Experience building enterprise-grade applications with GenAI or AI/ML integrations.

Technical Skills

  • Expertise in Databricks, Apache Spark, and Delta Lake.
  • Strong programming skills in Python and SQL; knowledge of libraries like pandas, NumPy, or PyTorch is a plus
  • Understanding of state management libraries like Redux, Recoil, or Zustand.Cypress), and version control (Git).
  • Understanding of web security principles and compliance requirements for enterprise applications.

Soft Skills

  • Exceptional problem-solving and decision-making abilities.
  • Excellent communication and leadership skills, with the ability to guide technical discussions and mentor team members.
  • Strong focus on delivering quality

?How can you contribute to the team?

The Senior Data Engineer will specialize in building and optimizing data pipelines with Databricks and preparing datasets for Large Language Models (LLMs). This role will focus on designing scalable, efficient data architectures to support cutting-edge machine learning initiatives, particularly in generative AI applications.

  • Data Pipeline Development:
  • Design, implement, and optimize end-to-end data pipelines using Databricks, AWS, Azure, and related technologies.
  • Build workflows to handle large-scale data ingestion, transformation, and storage.
  • Data Preparation for LLMs:
  • Preprocess, clean, and structure diverse datasets (text, structured, and unstructured) for LLM training and fine-tuning.
  • Implement feature engineering, tokenization, and vectorization techniques to support NLP models.
  • Performance Optimization:
  • Use Databricks features, including Delta Lake and MLflow, to streamline data workflows.
  • Optimize data infrastructure for high availability, scalability, and cost-efficiency.
  • Collaboration with Teams:
  • Work closely with data scientists, ML engineers, and other stakeholders to understand data requirements for LLM technology requirements.
  • Ensure alignment between engineering pipelines and machine learning goals.
  • Data Quality & Governance:
  • Implement processes to ensure data quality, consistency, and compliance with governance policies.
  • Monitor and maintain data integrity throughout the pipeline lifecycle.
  • Emerging Technology Adoption:
  • Stay updated on advancements in Databricks, generative AI, and LLM technologies.
  • Contribute to the adoption of innovative tools and practices to improve workflows.

Switzerland

  • Germany
  • Spain
  • Slovakia
  • Romania
  • Philippines
  • Singapore
  • USA

ERNI Development Center Philippines Inc., 9th Floor, Lica Malls Shaw, 500 Shaw Boulevard, 1555, Mandaluyong City, Philippines

+63 5310 1707 | www.betterask.erni | [email protected]

We deliberately focus on what we know best.

  • 18 Locations in 8 Countries
  • 800+ Employees across the Globe
  • ISO Certified

职位要求

Please refer to job description.

数据建模ETL ProcessesSQLPythonData WarehousingBig Data Technologies云计算Data Pipeline AutomationNoSQLData Quality Assurance
Preview

Boss

HR ManagerERNI

工作地址

500 Shaw Blvd, Mandaluyong, National Capital Region, PH

发布于 26 April 2025

ERNI

501-1000人

其他

查看热招工作

举报

Bossjob安全提醒

如果该职位要求您在海外工作,请保持警惕,谨防欺诈。

如果你在求职过程中遇到有以下行为的雇主, 请立即举报

  • 扣留您的身份证,
  • 要求您提供担保或收取财产,
  • 迫使你投资或筹集资金,
  • 收集非法利益,
  • 或其他非法情况。