Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Proposal for a B.S. in Data Science at William & Mary: Curriculum and Tracks, Study notes of Mathematics

Data Science programsSTEM certificationAcademic majorsStudent Enrollment

The proposal for a Bachelor of Science (B.S.) in Data Science degree program at William & Mary. The program aims to provide students with a blend of skills from various fields, including mathematics, computer science, and information science, to efficiently conduct data analyses and communicate technical findings to diverse audiences. The document also discusses the increasing demand for such a degree and the unique advantages of William & Mary's program in comparison to other institutions.

What you will learn

  • Why is there a growing demand for a B.S. in Data Science degree?
  • What are the key differences between a B.S. in Data Science and related programs like CAMS and BUID at William & Mary?
  • What skills will students acquire through the proposed B.S. in Data Science program at William & Mary?

Typology: Study notes

2021/2022

Uploaded on 08/05/2022

jacqueline_nel
jacqueline_nel 🇧🇪

4.4

(229)

506 documents

1 / 7

Toggle sidebar

Related documents


Partial preview of the text

Download Proposal for a B.S. in Data Science at William & Mary: Curriculum and Tracks and more Study notes Mathematics in PDF only on Docsity! 1 Proposal for a Major in Data Science Executive Summary Data Science is the interdisciplinary study of methods used to gain knowledge from diverse sets of data, which are frequently quite large and multi-dimensional. Students trained in Data Science will study a blend of topics from many subdomains of communications, philosophy, mathematics, computer science, and information science. This combination allows Data Science students to efficiently conduct computational analyses within their own knowledge domain, manage teams of more specialized individuals to answer far-ranging questions, and communicate technical findings to diverse audiences. Data Science differs from applied mathematics, statistics, or computer and information science in that a data scientist has a breadth of experience across all of these fields but may not have as much knowledge as a specialist in any particular field. Demand for a major in Data Science is high. Since the inception of the Data Science program in the fall of 2017, student enrollment has steadily risen and currently, as of January 2019, the program has approximately 60 students enrolled (34 self-designed majors and 26 minors). To meet the ever-increasing student demand, the Data Science Program steering committee concludes that a structured B.S. major in Data Science needs to be offered at William & Mary during the 2019/2020 academic year. This document summarizes the necessary steps required to craft a degree program that the steering committee believes would be eligible for SCHEV-STEM certification, on the basis of on an analysis of peer and other institutions’ implementations of Data Science curricula. Justification for a Data Science Major at W&M To quote President Rowe in a recent article in the Washington Post: “College graduates must have the essential capacities to be agile, multidimensional and adaptable critical thinkers, prepared for the rapid pace of change we know lies ahead — and for what we cannot yet know. The speed of technological transformation is indisputable. And that speed places a premium on intellectual agility, multidimensionality and the ability to navigate change ethically and in a data-informed and values-driven way” (https://www.washingtonpost.com/education/2018/11/09/problem-solvers- tomorrows-workforce/?utm_term=.d2f483e69b4a). We concur with President Rowe, and argue that in the context of William & Mary, a major in Data Science is justified for three primary reasons. First, William &Mary is well positioned to lead the effort to train data scientists with a degree in the liberal arts, a unique and competitive niche that employers demand in response to the rapid increase in complex, data-driven problems. Second, there is clear student demand for a major. Third, a degree in Data Science is distinct from other degrees such as Computational and Applied Mathematics and Statistics (CAMS) or Business Data Analytics (BUAD). We elaborate on each of these points below. 2 William & Mary is poised to become a leader in training data scientists who have a background in the liberal arts; these are the exact kind of graduates that employers seek to meet growing demand for data scientists who can communicate effectively and think creatively about data- driven problems. As a liberal arts university, the Data Science program will integrate with all disciplines of Arts and Sciences and the Business School. Our comparative advantage--compared to other universities with data science programs--is our ability to train students with not only data science skills but also the communication and critical thinking skills needed to effectively define, discuss, and solve complicated problems related to the management, analysis, and dissemination of vast amounts of information. Second, there is clear demand among William & Mary students for a formalized major. At present, the Data Science program offers a minor and self-designed major in Data Science. Since the inception of Data Science in the fall semester of 2017, student enrollment has steadily increased. As of January 2019, approximately 34 students have declared a self-designed major and 26 a minor. Given the ever-increasing student demand, the Data Science Steering committee believes a structured B.S. in Data Science is needed to meet student demand and to more effectively and efficiently administer the Data Science program. Third, the proposed B.S. in Data Science differs from closely aligned programs in CAMS and BUAD in important ways. Requirements for majors in CAMS and BUAD only tangentially cover Data Science and do not include the breadth of courses required in a proposed B.S. in Data Science. As two illustrative examples, majors in CAMS’ Applied Statistics track, the major most closely related to the Data Science major, are required to take three courses in micro- and macro- economics and five courses in fundamental mathematical statistics. For electives, students take two courses in advanced economics, and two to three courses in advanced mathematics and computer science. The BBA in Business Analytics with Data Science requires students to take at least 18 credits in business fundamentals, such as accounting and business management, 12 credits in business analytics, and one elective course from a pool of six courses, with only one choice in Data Science (BUAD 460: Big Data Analytics). In contrast, the proposed B.S. in Data Science exposes students to a suite of courses in Data Science applications, including deliberation (considering the ethical, moral, and societal implications of data science) and communication, and courses that promote critical evaluation of how data can be used to solve novel problems. In particular, the proposed B.S. in Data Science requires students to take five courses with a focus on programing, modeling, machine learning, data visualization, database structures, and ethics in Data Science. Many of these courses are unique to the B.S. in Data Science, and not offered by any other programs or departments. Moreover, students will also take two courses in mathematical statistics and one in linear algebra. Last, again unique to the B.S. in Data Science, students are required to take three courses from one of the following tracks: algorithm track (Computer Science), spatial track (Geographical Information System), statistical mathematics track (Mathematics), or data application track (Interdisciplinary). The latter track will include new courses not offered in any other program/department at William & Mary. These courses, in part, will be designed by new TE/NTE faculty and will overlap with their areas of expertise in Data Science. 5 DATA 3XX - Databases and Structures: This is new course preparing students for common database programs found in data analytics (i.e., SQL; Hadoop-based infrastructure) and provides an applications-based education on theoretical data structures. A similar course (CSCI 241 – Data Structure) is currently offered at William & Mary, but lacks the focus on applications and theory necessary for data structures specific to Data Science (especially Hadoop-like infrastructure that focuses on exceptionally large datasets). Linear Algebra and Statistical Modeling (9 credit hours) Courses which provide students with core statistical knowledge, focusing on providing more breadth than DATA 146 – Reasoning under Uncertainty. MATH 211 – Linear Algebra: This an existing course offered by the Department of Mathematics. MATH 351 or (MATH 451 with additional prerequisites) - Probability and Statistics for Scientists: This an existing course offered by the Department of Mathematics. A statistical and modeling background is critical for students to be prepared to engage in Data Science problems. Although DATA 146 (Reasoning under Uncertainty) will cover the basic concepts of parametric modeling, MATH 351 provides much more depth to students in mathematical statistics. MATH 352 or (MATH 452 with additional prerequisites) - Statistical Data Analysis: This an existing course offered by the Department of Mathematics. The second component of mathematical statistics, again providing students with a more depth in statistical analyses. Substantive Specialty Courses (pick 3 courses, 9 credit hours) This pillar of the degree would focus on providing students with a substantive focus in data analytics. All such tracks would be built and approved in conjunction with the departments and programs that they might be housed within. A core data application track would also be offered for students that seek more depth within Data Science in particular. Examples of these tracks currently being pursued by students include a Spatial Data Track and an Algorithms Track. Other example tracks that are desired by students but would require negotiation with partner programs and departments include econometrics (Economics Track), survey design (Psychological Sciences/Government/Sociology Track), and bioinformatics (Biology Track). We propose that students chose three courses in one of three tracks listed in Table 1. By no means is the list of tracks exhaustive and we foresee additional tracks to be added in the near future. 6 Table 1 Examples of tracks currently pursued by Data Science students (Spatial Data and Algorithms Track) and a new track (Data Application track) to be implemented by the fall 2019. Some of the courses listed here would be new. Data Science affiliated faculty would be given the opportunity to design advanced elective courses on the basis of their interests, such as Bioinformatics, Data Driven Journalism, Computational Linguistics, Computational Geography etc. Spatial Data Track Data Application Track Algorithms Track GIS 201 (Intro. to GIS) DATA 400-level courses being taught in the specialty areas of CSCI 241 (Data Structures) GIS 400 (Remote Sensing) the new TE & NTE hires CSCI 243 (Discrete Structures of Computer Science) GIS 405 (Geovis. & Spatial Design) Examples: GIS 420 (Advanced GIS) GOVT 391 (GIS for Social Science) BIOL 445 (GIS for Biologists) DATA 440 (Data Driven Journalism) GOVT 391 (Topics in Government) LING 380 (Computational Methods in Language Science) Data Science for Humanities CSCI 303 (Algorithms) Senior Capstone (4 credit hours) DATA 4XX - Senior Capstone; COLL 400: This is a new course. We would offer three sections within an academic year, for students whose primary interests are in ALV, NQR, and CSI. These would be seminar-style courses where the focus is on each student developing a capstone project or portfolio that applies their Data Science skills to a substantive problem of their choice. This would be taught as a hybrid course that integrates elements of a seminar on a particular topic an honors colloquium, and a studio art senior capstone experience. To fulfill the COLL 400 requirement, students would present their research as either a presentation, poster, or video, to a diverse audience. 7 Trajectory for a B.S. in Data Science 1. In the first three semesters, a student should take DATA 141, DATA 146, and Calculus I & II, or MATH 211 if Calculus II was completed prior to enrolling at William & Mary. Ideally, a student would also take either/both the deliberation (currently DATA 150 or PHIL 215, 303, 312 or 3XX, a new course in Data Ethics) requirement or the visualization requirement (DATA 2XX). 2. In semesters four to six, a student will take DATA 301, DATA 3XX (Databases and Structures), complete the deliberation and visualization requirements, MATH 351, MATH352, and at least two of their three required Substantive Specialty Courses. 3. In a student’s final year, they would complete any missing requirements (due to study abroad, etc.) and take their capstone seminar. DATA 141 (Programming) Substantive Specialty Courses (Choose 3) DATA 146 (Modeling) DATA 2** (Visualization) DATA 301 (Applied ML) DATA 4** Capstone DATA 3** (Databases) DATA 2** (Deliberation & Ethics) MATH 351 MATH 352
Docsity logo



Copyright © 2024 Ladybird Srl - Via Leonardo da Vinci 16, 10126, Torino, Italy - VAT 10816460017 - All rights reserved