Data Scientist
Herndon, VA OR Columbia, MD, Remote
Clarity Innovations
We are your trusted partner for edtech strategy, content, and engineering.Clarity Innovations is a trusted national security partner, dedicated to safeguarding our nation’s interests and delivering innovative solutions that empower the Intelligence Community (IC) and Department of Defense (DoD) to transform data into actionable intelligence, ensuring mission success in an evolving world.
Our mission-first software and data engineering platform modernizes data operations, utilizing advanced workflows, CI/CD, and secure DevSecOps practices. We focus on challenges in Information Warfare, Cyber Operations, Operational Security, and Data Structuring, enabling end-to-end solutions that drive operational impact.
We are committed to delivering cutting-edge tools and capabilities that address the most complex national security challenges, empowering our partners to stay ahead of emerging threats and ensuring the success of their critical missions. At Clarity, we are people-focused and set on being a destination employer for top talent, offering an environment where innovation thrives, careers grow, and individuals are valued. Join us as we continue to lead innovation and tackle the most pressing challenges in national security.
Job Summary:
As a data scientist on the data team, you will be responsible for developing and implementing machine learning models and advanced analytics to support data-driven decisions within the company’s target market. You will work on data exploration, feature engineering, and model optimization while collaborating with other data engineers to ensure seamless data processing in the data lake backing the company's SaaS platform. This role involves analyzing large datasets and communicating unique data-derived insights to internal/external stakeholders. Additionally, you may contribute to improving data ETL pipelines, ensuring data quality, and will need to stay updated with the latest advancements in data science and AI/ML techniques. You will work closely with our team to identify strategies for deployment of advanced data products in line with the product roadmap and business vision.
Data Analysis and Exploration:
- Conduct thorough data exploration to understand the structure, content, and quality of data within the data lake.
- Explore novel data sets and apply statistical methods to analyze data, identify patterns, trends, and correlations to derive actionable insights.
- Prepare and clean data for analysis, including handling missing data, outliers, and ensuring data consistency across multiple sources.
- Conduct feature engineering to develop and test features for machine learning models and ensure they contribute effectively to desired outcomes based on the purpose of the model.
- Develop new AI/ML models to address mission problems in alignment with the company’s vision, target market segments, and product roadmap.
- Lead the research, design, and deployment of AI/ML models for various workflows including predictive analytics, data classification, and anomaly detection.
- Select appropriate algorithms based on the problem at hand considering factors such as accuracy, interpretability and computational efficiency / resource limitations.
- Design and conduct experiments to test hypotheses, evaluate models, and refine algorithms, using advanced techniques such as A/B testing.
Advanced Analytics and Data Products
- Develop advanced analytic algorithms to derive actionable insights and create new data-driven products.
- Drive the creation of new data products based on data science research leading to the creation of advanced analytics based on geospatial insights, statistical analysis, customer behavior analysis, trend discovery and forecasting.
- Work with application engineering team to ensure data products are integrated with existing access models to promote maximum customer adoption of new data products.
Data Ingestion and Governance
- Contribute to the development and maintenance of ETL pipelines for ingesting and processing large volumes of structured and unstructured data from various source into the data lake.
- Work closely with other data engineers and the senior systems architect to optimize storage schemas, ensuring choice of data types and structures that support efficient storage, retrieval, and processing of data.
- Evaluate and recommend new tools, technologies, and methodologies to improve data science processes and outcomes.
- Identify and implement optimizations in data ingestion and processing pipelines to improve performance, reduce costs, and enhance data pipeline scalability/stability.
Data Visualization & Presentation:
- Create clear and compelling visualizations to effectively communicate data derived insights.
- Choose appropriate types of charts, graphs, and other visual tools based on the data product and receiving audience.
- Recommend and utilize appropriate data visualization software (Tableau, Power BI, Quicksight, Plotly, Grafana, etc..)
- Prepare and deliver presentations that convey data insights and recommendations to both technical and non-technical audiences.
Client Engagement and Data Product Value Expertise
- Demonstrate effective communication of data product fit and value to current and potential clients. Clearly articulate the technical aspects and value proposition of the company’s data, analytics, and other data products to clients.
- Collaborate with the sales team to tailor data product solutions to client specific needs.
- Deliver data product demos, presentations and conduct technical discussions with potential clients. Address client queries, concerns and technical questions related to our data solutions.
- Assist in implementation, deployment and integration of data products with client systems to ensure maximum uptake and value realization
- Collect and document client feedback to inform future product development direction and improve the organizations overall data product offerings.
* Salary range is an estimate based on our InfoSec / Cybersecurity Salary Index 💰
Tags: Analytics Business Intelligence CI/CD DevSecOps DoD Governance Grafana Machine Learning SaaS
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.