data
Posted Apr 23Data Scientist II - Big Data R&D, Identity Graph & KYC
at Socure
United StatesHybrid
Requirements
- You will work closely with senior data scientists and engineers while developing your skills in large-scale ML, distributed systems, and graph analytics.
- WHAT YOU'LL DO - Contribute to the design and implementation of machine learning, data mining, statistical, and graph-based algorithms to analyze very large datasets for identity verification and anomaly detection.
- - Build and maintain components of data-processing pipelines (ETL, feature generation, normalization) using tools such as Spark/PySpark and AWS (e.g., EMR, S3).
- - Implement and maintain SQL and Python/R code for data extraction, transformation, and validation; contribute to code reviews and basic testing.
- WHAT YOU BRING - Master’s degree with 2+ years of experience, or Ph.D. with 1+ years of
- experience in a data science or analytics role, or equivalent practical experience. - Proficiency in at least one general-purpose programming language used in data science (Python, or Scala). - Solid
- experience writing and optimizing SQL for large datasets; comfort working in data lake / warehouse environments. - Hands‑on
- experience with Spark or PySpark and common ML libraries (e.g., scikit‑learn, XGBoost, TensorFlow/PyTorch a plus). - Familiarity with UNIX environments and the AWS ecosystem (e.g., EMR, S3); Databricks
- experience is a plus. - Working knowledge of supervised/unsupervised ML and basic statistics (similarity measures, clustering, evaluation metrics). - Exposure to graph techniques or graph databases (Neo4j, AWS Neptune, GraphFrames) is a strong plus. - Bonus:
- experience with Elasticsearch or DynamoDB; workflow tools such as Airflow for automating data pipelines. - Ability to break down loosely defined problems, ask good clarifying questions, and iterate quickly with feedback.
Contact
- Follow Us! YouTube https://www.youtube.com/c/Socure | LinkedIn https://www.linkedin.com/company/socure/ | X (Twitter) https://x.com/socureme | Facebook https://www.facebook.com/socure/
Additional details
- WHY SOCURE? Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts.