MEDS Class of 2023

Program Learning Outcome (PLO) #1 Assessment - Core Knowledge

Author

Sam Csik

Published

June 9, 2023

Summary

This post-program PLO assessment was administered to the MEDS class of 2023 (response rate = 29 / 31 students) on June 7, 2023.

Heads-up! No pre-program PLO assessment data for the Class of 2023

This assessment of MEDS PLO #1 (Core Knowledge) was developed by the MEDS Curriculum Cohesion Committee in winter/spring 2023, and therefore was not administered to the MEDS Class of 2023 before beginning the program in August 2022 (i.e. no pre-program assessment was conducted). No benchmark data is available for the Class of 2023.

The survey consists of 31 questions (28 multiple choice, 3 short free-response) and takes ~30 minutes to complete. Questions 1 - 15 ask respondents to rank how often they use certain data science tools or workflows, or familiarity/comfort levels with particular topics (see Part 1 through Part 4, below). Questions 16 - 31 assess respondents’ familiarity and application of domain-specific knowledge/tools taught during the MEDS program (see Part 5 onward, below). Many of these question types are multi-part and begin with a question phrased as:

  • “How familiar/comfortable are you with X” (rank 1 (never heard of it) > 5 (very familiar))
  • “Have often have you done/implemented Y” (rank 1 (never) > 5 (all the time))

If a respondent chooses a level of 2 or greater, they proceed to the remaining part(s) of the question to be tested on their knowledge/understanding of that topic. If a respondent chooses a level 1, they are skipped to the next question (to prevent respondents from guessing at answers). A perfect score is 14 points. See the distribution of scores, below:

Individual Questions

Note

Questions that have a correct answer are color-coded green.

Part 1: OS and data/document storage

NOTE: Percentages will not sum to 100%

Part 2: How often do you currently use the following?

Part 3: Workflow satisfaction

Part 4: Rank the following from 1 (strongly disagree) to 5 (strongly agree)

Part 5: Stats

All students answered all parts of question 17

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to answer all parts of question 17.

Below is a chunk of code showing a simple linear regression relating the number of pieces of microplastics to the number of days per year with rainfall.

Question 17b raw responses

Some respondents recorded their answers in sentence form, while others did not round their answers to the nearest integer. Cleaned responses are shown in the plot, above. Responses as they were recorded are included in the table, below:

All students answered all parts of question 18

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to answer all parts of question 18.

62% of respondents correctly answered question 18b (i.e. chose exactly the following options: normal, uniform, bimodal, symmedtric)

Part 6: Programming 1

All students advanced to answer question 19b

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to answer part b of question 19.

All students answered all parts of question 19

29 / 29 student respondents (100%) chose a comfort level of 2 or greater, and therefore were directed to part c of question 19.

The following code (in R) defines a function:

compute_turbine_power <- function(height, flowrate, efficiency, maxheight){
  
  if (height < maxheight) {
    
    power = height * flowrate * efficiency
    
  } else {
    
    power = maxheight * flowrate * efficiency
    
  }
  
  return(power)
  
}

This R code applies this function to data:

flowrate = 2
maxheight = 20
power_turbine_a <- compute_turbine_power(10, flowrate, 0.5, maxheight)

Part 7: Environmental Modeling

Part 8: Geospatial Analysis & Remote Sensing

All students answered all parts of question 21

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to answer all parts of question 21.

100% of respondents correctly answered question 21b (i.e. chose exactly the following options: raster, vector)

All students answered all parts of question 22

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to answer all parts of question 22.

28 / 29 respondents advanced to Question 23b

28 / 29 student respondents (96.5517241%) chose a familiarity level of 2 or greater – these respondents were directed to answer Question 23b. One respondent selected 1 (never worked with it before) in response to Question 23a and was jumped directly to Question 24.

27 / 29 respondents advanced to Question 24b

27 / 29 student respondents (93.1034483%) chose a familiarity level of 2 or greater – these respondents were directed to answer Question 24b. Two respondents selected 1 (never heard of it) in response to Question 24a and were jumped directly to Question 25.

Part 9: Machine Learning

All respondents advanced to Question 25b

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to Question 25b.

All respondents advanced to Question 25c

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to Question 25c.

All respondents advanced to Question 26b

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to Question 26b.

All respondents advanced to Question 26c

29 / 29 student respondents (100%) chose a familiarity level of 2 or greater, and therefore were directed to Question 26c.

75.9% of respondents correctly answered question 26c (i.e. chose exactly the following options: My model is likely to perform very well when applied to new data, My test set has data entry errors in it)

Part 10: Environmental Justice

Part 11: Data Viz & Communication

Identify 4 areas for improvement in the following data visualization that shows information about Michigan counties with highest college attendance.

Wordcloud of most frequently occurring words used to describe suggested improvements to the above data visualization (Question 30)
Question 30 raw responses

Free responses as they were recorded are included in the table, below:

Part 12: Programming 2

# define function
def convert_F_to_C(temp_F):
  temp_C = (temp_F-32)*5/9
  return temp_C

# use function
convert_F_to_C(32)


End MEDS Class of 2023 PLO Assessment Report


Return to main page