Logical are proud to be partnering with a long standing customer. Our client specialise in Data Infrastructure & Analytics and are continuing to rapidly grow. We are seeking all levels of candidates that specialise in Scala & Apache spark.
This role can be based Melbourne (4 days per week in the office) or completely remote across Australia.
Key Responsibilities:
• Software Development: Design, develop, and optimise distributed applications using Java and Scala to support large-scale data processing and analytics.
• Apache Spark Expertise: Utilise deep knowledge of Apache Spark internals to build and optimise efficient data pipelines, ensuring performance and scalability.
• Table Formats: Work with open-source table formats like Apache Iceberg, Delta Lake, Apache Hudi, and Apache Xtable to manage and process large data sets effectively.
• Lakehouse Architecture: Implement open Lakehouse solutions, including cataloging systems like Unity Catalog and Polaris Catalog, and manage ML workflows with MLflow.
• Computer Science Fundamentals: Apply a solid understanding of data structures, caching mechanisms, networking, and database management to build efficient systems.
________________________________________
Key Requirements:
• Experience: Mid to senior-level experience (3+ years) in Java or Scala development.
• Computer Science Knowledge: Strong understanding of core concepts such as data structures, caching strategies, networking, and database systems.
• Apache Spark: In-depth knowledge of Apache Spark internals, including job execution, query optimisation, and distributed data processing.
• Open Source Table Formats: Hands-on experience with Apache Iceberg, Delta Lake, Apache Hudi, or Apache Xtable.
• Open Lakehouse & MLflow: Familiarity with open Lakehouse architecture tools such as Unity Catalog, Polaris Catalog, and MLflow for data management and machine learning integration.
________________________________________
Preferred Qualifications:
• Experience with cloud platforms (AWS, GCP, Azure) and distributed systems.
• Familiarity with CI/CD pipelines and version control systems (Git, GitHub).
• Understanding of machine learning workflows and their integration into data pipelines.
If this role suits your skill set, please apply on the link below. For more details please email: frankie@logicalresources.com
443 little collins street
melbourne, vic 3000
423 kirkstall road
kent, leeds, ls12 7rs