
Python Data Engineer/Data Scientist
Caspian Oneâ˘WrocĹawâ˘Hybrydowa
đ Opis gĹĂłwny / WstÄp
Senior Python Data Engineer / Data Scientist
Location: Flexible (Hybrid in Wroclaw)
Department: FrontâOffice Technology & Analytics
Type: Fullâtime
About the Role
Weâre working with a leading global investment organisation that is expanding its frontâoffice engineering function. This role sits at the intersection of data engineering, applied data science, and frontâoffice portfolio support, with a focus on building highâquality Python systems that directly enable portfolio managers, analysts, and risk teams.
Youâll work across several investment teams, helping to rewrite key components, productionise POCs, optimise data access patterns, and deliver reliable, scalable pipelines that power core analytics and proprietary models.
If you enjoy solving complex data problems, building production systems endâtoâend, and working closely with frontâoffice stakeholders, this is a highâimpact opportunity.
What Youâll Work On
Design, build, and operate Python-based data pipelines (batch & nearârealâtime) reading/writing to governed enterprise storage (e.g., parquet patterns via internal cloud storage).
Develop productionâready ML/data components including feature prep, scoring pipelines, and APIâexposed data services.
Support the reâengineering and migration of legacy workflows into modern Python-based solutions.
Partner directly with frontâoffice portfolio managers and analysts to extend and industrialise proprietary models.
Optimise Databricks connectivity and collaborate with platform teams to improve Spark/PySpark workloads.
Support portfolio teams with data access, connectivity principles, and technical design for productionisation.
Contribute to engineering standards: CI/CD, testing, code reviews, observability, automation, and documentation.
Work on POCs through to full production rollout, including rewriting logic from other languages into Python where required.
What Weâre Looking For
MustâHave Skills
Strong, professional-level Python engineering skills (data structures, packaging, typing).
Experience with ETL or MLâoriented Python workflows.
Experience building robust data pipelines and working with parquet/columnar formats in governed storage.
Exposure to PySpark or distributed data processing.
Comfortable working in platform environments (Databricks, containerised services, internal dev platforms).
Strong software engineering foundations: Git, testing, code review, CI/CD, documentation.
NiceâtoâHave Skills
Experience with geospatial data (Azure Maps) or risk/analyticsâdriven datasets.
Familiarity with FastAPI or Flask for serving models/services.
Exposure to orchestration tools (Airflow, Databricks Jobs).
Understanding of enterprise data governance principles and collaboration models.
Who Thrives Here
Engineers who enjoy direct collaboration with frontâoffice teams and shaping technical direction.
People who like owning solutions endâtoâendâfrom requirements gathering through production deployment.
Those who value clean, maintainable engineering practices in a fastâpaced, data-driven environment.
Individuals comfortable working across multiple teams and adapting quickly to new problem spaces.