Undergraduate Links
Data Science B.S. Learning Outcomes
LO1: Knowledge of foundational mathematical concepts relevant to data analysis, including
- calculus (integration and differentiation), both univariate and multivariate
- applied linear algebra (matrix operations, linear transformations, projections in Euclidean
space) - concepts in probability including univariate and multivariate distributions for both
discrete and continuous densities, random variables, independence and conditional
independence, expectation, limit theorems
LO2: Knowledge of basic principles in computer science, including
- basic concepts in Boolean algebra and logic
- basic concepts in computer hardware
- basic concepts in operating systems and programming languages
- basic concepts in data management
- basic concepts in algorithms and data structures
- basic concepts in computational complexity
LO3: Knowledge of foundational statistical concepts, including
- foundational statistical theory including parameter estimation, hypothesis testing,
decision theory, maximum likelihood, and Bayesian methods - statistical modeling principles, including linear and logistic regression models, nonparametric methods, diagnostic techniques
- design of statistical studies, including sampling methodologies, random assignment,
data collection, efficiency, and issues of bias, causality, confounding, and coincidence;
sample size and power calculations - exploratory data analysis methods including visualization, projection methods, clustering,
and density estimation
LO4: Knowledge of basic principles in statistical computing including
- ability to understand and program randomization and simulation techniques such as
bootstrap, Monte Carlo and cross-validation methods - ability to understand and program optimization methods such as gradient descent and
solving linear systems of equations - ability to understand and implement basic concepts and algorithms in machine learning,
such as predictive modeling for classification and regression - basic knowledge of issues in handling large-scale data, such as scalability, indexing,
data management, and distributed computing
LO5: Ability to take a real-world data analysis problem, formulate a conceptual approach to the
problem, match aspects of the problem to previously learned theoretical and methodological
tools, break down the solution into a step-by-step approach, and implement a working solution
in a modern software language, including
- knowledge of basic concepts in developing and testing software programs
- knowledge of basic concepts in software engineering skills and experience in developing
software in at least one modern programming language such as Python - basic skills in data management, including relational database systems and SQL
- basic skills in at least one statistical programming environment such as R
- basic concepts in data collection such as sampling methods, handling missing data, data
provenance, ethics in data collection - basic concepts in data privacy, e.g., legal aspects, institutional review boards,
algorithmic techniques for data anonymization
LO6: Ability to communicate effectively in data analysis projects, including
- effective technical writing and presentations
- teamwork and collaboration
- communicate results to both data analysis specialists and non-specialists accurately,
concisely, and effectively
We’re here to help. Please contact Venette Van Duyn to request services.