Apply for this job now

Python Developer with Containers (GenAI)

Location
New York
Posted
23 Jan 2026
Python Developer with Containers (GenAI)

Duration: 12-18 months

W2 Candidates only

Bryant Park New York New York 10038

(we have 2 roles 1 - Mid/Sr level and other is Lead)

Key Responsibilities:
o Collaborating with team and with partners in QSDG and Platform to define, build, test, and deploy platform meeting requirements.
o Defining & enforcing standards & best practices related to platform management.
o Evaluate third-party products to meet scalability, resiliency, and performance.
o Build new or leverage existing platforms (Lab, SDLC) by automating setup, installation, verification, monitoring & provisioning processes.
o Maintaining a central, version controlled, inventory of all environments, including their current versions and configuration settings.
o Planning & allocating environments to teams depending on their delivery lifecycle.
o Analyzing data to identify and proactively address environment-related issues.
o Working with project teams to manage costs & improve efficiency of environments.
o Partnering closely with Prod Support and Engineering to deploy & support applications.

Required Skills:
o Prior experience designing, implementing, and maintaining end to end environments, from POC to production.
o Deep understanding of hardware, software, network, data & application configuration.
o DevOps processes and CICD tooling (Jira, Git/Bitbucket, Jenkins, Datival, Artifactory, Ansible), orchestration & automation.
o Multi-tier (Python based) web application stack microservices/serverless/loosely coupled architecture.
o Mix of on-premises and cloud based, containerized (Docker/Kubernetes/OpenShift) deployment models.
o Familiarity with no-SQL (MongoDB) and relational (SQL Server/Oracle) databases, and other various forms of Object, Vector, and file stores.
o Unix scripting, SQL, work scheduling tools.
o Setting up infrastructure monitoring & reporting for GPU/CPU & memory consumption, inference latency and model performance.
o Performance profiling & optimization techniques to maximize performance & resource consumption / throughput and minimize latency.
o Load balancing, high availability & backup recovery strategies/techniques.

Non-Technical
o Ability to communicate effectively to a wide range of audience (business stakeholders, developer & support teams).
o Meticulous & highly organized.
o Adaptable to shifting & competing priorities.
o Skilled at delegating, mentoring & setting expectations.
o Critical thinking skills to diagnose & resolve complex issues.
Experience desired:
o Familiarity with AI & Deep learning, modeling techniques, Generative AI application stack.
o Proficiency in Python and familiarity with AI frameworks (TensorFlow/PyTorch).
o GPU cluster management (CUDA/Kubernetes), auto-scaling & scheduling (Triton Inference Server).

Qualifications:
o 7-10 years in similar roles. Preferably in the financial industry.
o Higher education in IT field or relevant previous work experience.
Apply for this job now

Details

  • Job Reference: 2584131044-2
  • Date Posted: 23 January 2026
  • Recruiter: Apex Informatics
  • Location: New York
  • Salary: On Application