We convert our clients' core python projects into pyspark for distributed processing. Having a broad range of cool industry projects from simple data engineering, sklearn simple ML, Image/video processing to NLP related projects. Data scientist having DS pipeline and workflow understanding on top of that experience into at least one or more than one area described above. We will provide Apache spark and PySpark training to candidates who are not familiar with this.
Experience
Fresh to 2 years of experience as a data scientist working with ML/DL projects.
Skills
Strong Deep learning and computer vision-related concepts and experience.
Experience with basic python libraries like NumPy, Pandas, Matplotlib, sklearn and pickle etc.
Understanding of complete DS process ETL, data wrangling and modeling, training, validation and deployment etc.
Should be able to understand state of the art deep learning models (CNNs, LSTM, GANS, YOLO etc.) and strong grip on TensorFlow and PyTorch to implement these models.
Experience with image/video processing tasks like object tracking, object detection, segmentation using pre-trained or custom defined models.
Should be able to follow compatibility of different versions of TensorFlow & PyTorch and converting code from one version to another.
Understanding of GPU libraries like CUDA etc. and run deep learning models with GPU.
Good English communication and presentation skills.
Having Apache spark, PySpark, Databricks and AWS SageMaker knowledge and experience would be a plus.
Creating a diverse and inclusive workplace is one of Baltoro’s core values. We are an equal opportunity employer and welcome people of different backgrounds, experiences, abilities and perspectives.