Synthetic data project framework

Fermé
Contact principal
Chromatic Data
Mississauga, Ontario, Canada
Il / Iel
Founder & CEO
(9)
6
Portails
(1)
Projet
Expérience académique ou travail rémunéré
60 heures de travail au total
Apprenant.e
N'importe où
Niveau Avancé

Portée du projet

Catégories
Analyse des données Modélisation des données Apprentissage automatique Intelligence artificielle Data science
Compétences
statistical programming statistical inference multivariate statistics nonparametric statistics data science ai/ml inference machine learning algorithms
Détails

Our company is interested in creating frameworks / templates for our pilot projects with social impact clients. Impact of this work is to maintain efficient operational execution in how we structure our client work.


We would like to collaborate with students to provide cohesive, appropriate details relative to the pilot project scaffolding from our Data Scientists. Students will write technical statistical documentation on methodologies and write code (likely in Python, using Jupyter and/or Marimo notebook) so we can achieve a fulsome pre-pilot understanding of the workflow involved.


This will involve several different steps for the students, including:

  • Working with Data Scientist guidance to write code that assesses initial data type, structure, etc, especially on determining appropriate analytical tooling for client data use case categories
  • Expanding on baseline Data Scientist framework document for synthetic data 'utility' metrics, i.e. how useful to a given statistical model the synthetic data is
  • Completing all above in analytical objectives including but not limited to dimension reduction, relationship analysis, and clustering
Livrables

By the end of the project, students should demonstrate:

  • Understanding of key statistical modeling processes particularly as across different types and structures of social impact data
  • Development of core applied math and computer science skills in statistics and programming AI/ML applications

Final deliverables should include:

  • Source materials such as the data platform's back-end code (automated tooling by use case), if applicable, and accompanying technical documentation (data assessments, utility metrics) guiding our future pilot project work
Mentorat
Expertise et connaissances du domaine

Fournir des connaissances spécialisées et approfondies et l'industrie générale des idées pour une compréhension globale.

Outils et/ou ressources

Donner accès aux outils, logiciels et ressources nécessaires pour la réalisation du projet.

Réunions régulières

Enregistrements programmés pour discuter des progrès, relever les défis et fournir des commentaires.

Causes prises en charge

Les défis mondiaux auxquels ce projet s'attaque, en accord avec les objectifs de développement durable (ODD) des Nations unies. En savoir plus sur les 17 ODD ici.

Industry, innovation and infrastructure

À propos de l'Compagnie

Compagnie
Mississauga, Ontario, Canada
0 - 1 employé.es
Business & management, It & computing, Non-profit, philanthropic & civil society, Technology, Trade & international business
Représentation
Entreprises appartenant à des minorités BIPOC 2slgbtqia+-owned Petite entreprise Sustainable/green
+ 2

We're solving the existential pain point of the social impact sectors - funding scarcity - with data creation, management, and analysis services.