Senior Data Engineer, (Synthetic Environment Composition)



We are spinning up a new team, ‘Synthetic Environment (SE) Composition’ which will develop and deliver a standardised Asset Information Architecture, and be responsible for creating a set of applications and services to better manage, maintain and analyse our ecosystem of content. This team will support modelling scientists and architects by developing tools for design, verification and validation of models within Synthetic Environments.

This is a brand new area, taking all of the learnings and developments we have made in our customer projects, and shaping them into a new product offering. We are seeking a Senior Data Engineer to help develop a content classification and metadata catalogue for models and datasets created internally or by third-parties and create a knowledge base service, which will enable effective and efficient querying of the content catalogue by SE architects and model engineers.

Area of impact

    • To help develop a metadata cataloguing framework to be used by our existing and future projects; and help categorise and catalogue our wide range of assets
    • To help develop a Knowledge Base service that queries a range of asset stores and projects, and provides asset information such as content, usage, dependencies, ownership, access level, history, behaviours, etc based on an expanding ontology
    • To help design and build services needed for accessing, querying and updating assets, as needed by synthetic environment users and modelling engineers
    • To collaborate with the team to define, estimate, plan and deliver tasks, according to the product roadmap
    • Create engineering solutions that are scalable and efficient
    • Work closely with modelling engineers and applied scientists to better understand user stories and use cases and apply this knowledge to the design process
    • Support the product owner and the technical delivery manager with requirements capture, planning, backlog refinement, and stakeholder collaboration
    • Collaborate with other product development and project teams across the defence business

We would like to hear from you if you identify with the following

    • Proven experience in developing commercial software in multiple domains
    • Previous experience of working as part of an Agile product development team
    • Proven expertise with at least one object oriented programming language (C++, Python, Java, etc)
    • Familiarity with data representation and memory architecture
    • Experience building data models, with exposure to at least two or more of the following: Graph, Relational, Document-oriented databases.
    • Hands-on experience with data warehousing, data lakes and/or data lakehouse architectures
    • Exposure to distributed data processing systems (e.g. Apache Spark)
    • Importantly, you are considerate, humble, and a strong believer in teamwork.
While we think the above experience could be important, we’re keen to hear from people that believe they have valuable experience to bring to the role. If you identify with the team and mission, but not all of our requirements, then please still apply. 

About Us
Improbable is determined to foster an environment where people can do their best work and feel like they belong. We believe a healthy culture, strong values and contribution from a diverse range of individuals will help us to achieve success.
We do not discriminate based on race, ethnicity, gender, ancestry, national origin, religion, sex, sexual orientation, gender identity, age disability, veteran status, genetic information, marital status or any other legally protected status.
Life at Improbable
Diversity, inclusion & belonging
Apply for this job

Location: London

Date posted: 2022-01-20