Opportunity Description
Proofpoint is looking for a Senior Data Scientist to join our Machine Learning team.
Key Responsibilities
- Extract and derive actionable insights from massive volumes of unstructured email datasets using distributed computing tools (Spark, Iceberg, Athena) and AWS SageMaker.
- Engineer features that capture communication behaviors and patterns across billions of email records, translating domain expertise into scalable feature pipelines.
- Fine‑tune LLMs for domain‑specific tasks (email topic detection, NER) and train neural network models from scratch on massive volumes of email data to identify anomalous patterns and behaviors indicative of malicious threats.
- Apply classical machine learning techniques (e.g., boosting, bagging) to train models that classify large volumes of imbalanced datasets with low‑latency inference.
- Deploy trained models to production using AWS SageMaker, directly impacting the effectiveness of our core t...
Ready to Apply?
Submit your application for Senior Data Scientist at Proofpoint
Apply for this Position