Skip to main content

Data Space - Research

 

  • The PI is developing a laptop environment for students to use in studying big data analytics. Master students from the Data Lab have been testing and learning to use this environment.
  • Applications: consulted with Sam Woolford (MA, Center for Quantitative Analysis) for a local business using data mining techniques on the HPC; assisting the Public Sector Analytics Project to work with a large dataset in an unfamiliar format
  • Learned with Tumen Bayar (ATC), Norm Josephy (MA), Maria Skaletsky (ATC) and Jason Wells (ATC) to use the R Language libraries which support parallel processing and large datasets on the HPC
  • Built the high performance computing cluster (HPC) with Jason Wells (ATC) and Steve Morrow (Systems Services)
  • Installed Cassandra, a modern distributed database, and Spark, a computing environment for big data analytics, on the HPC