Data Scientist Intern
Job description:
The candidate must be reside in a HUBZONE location. The ideal candidate must be reside in and around the university and his/her address must be found in a HUBZONE location that can be verified with the following map
https://maps.certify.sba.gov/hubzone/map#center=44.722800,-103.249700&zoom=4
Please do not apply if your residential address does not exist within the designated HUBZONE locations as per the map.
The Brite Group Inc. is a Small Disadvantaged Business (SDB) with SBA (8A) certification, serving federal, state and local agencies in the Washington DC metro area. The Brite Group provides IT Services and integrated solutions covering the entire E2E data life-cycle – Enterprise Data Architecture, Data Management, building Business Intelligence (BI) Reporting platforms, Analytics, Data Science, AI/ML and Cloud Migration. The company has a large number of cleared resources and technical experience in implementing solutions both on-premises and cloud environments.
The Brite Group encourages a Data scientist currently pursuing a Bachelor’s Degree with experience with AI/ML projects in school or with other companies to apply to this role.
Develops, deploys, and manages statistical, unsupervised, and supervised Learning Data Models with accuracy, precision, and sensitivity. Performs mathematical/statistician modeling to organize, optimize, model, research, mine, and predict new data. Identifies accuracies, precision, and sensitivities. Validates machine learning with statistical modeling.
Designs algorithms such as neural networks, decision trees, and surrogate models.
Uses Data Lakes, chunking, large file shares, or increased time limits for storage needs; uses serverless computing like Athena/Lambda to handle concurrency or increased time limits; and uses extract, transform, and load standard data formats, or parsing due to lack of infrastructure.
- 1+ years of experience in custom developing Machine Learning models (including supervised and unsupervised) and Natural Language Processing (NLP) expertise focusing on unstructured data.
- 1 + years of in-depth knowledge of at least one analytical/statistical language (Python, R).
- 1 + years of experience manipulating unstructured data from different platforms.
The ideal candidate must be residing in and around the university and his/her address must be found in a HUBZONE location that can be verified with the following map
https://maps.certify.sba.gov/hubzone/map#center=44.722800,-103.249700&zoom=4