5 May 2023
Moderator: John Goodhue
Presenters: Cogan Shimizu
Bio: Cogan Shimizu is an Assistant Professor at Wright State University. His work focuses on pattern-based methods in knowledge engineering, especially targeting schema development methodologies, and the curation of effective KE curricula. He has interest in both foundational research in advancing the state of the art, as well as technical implementation and deployment of knowledge graphs, at scale. Recently, he's worked with the KnowWhereGraph team to create the world's largest open-source knowledge graph (12 billion triples- released May 6th), as well as with the Knowledge Graph Conference to launch the KG Open Curriculum.
Presenters: Cogan Shimizu
Bio: Cogan Shimizu is an Assistant Professor at Wright State University. His work focuses on pattern-based methods in knowledge engineering, especially targeting schema development methodologies, and the curation of effective KE curricula. He has interest in both foundational research in advancing the state of the art, as well as technical implementation and deployment of knowledge graphs, at scale. Recently, he's worked with the KnowWhereGraph team to create the world's largest open-source knowledge graph (12 billion triples- released May 6th), as well as with the Knowledge Graph Conference to launch the KG Open Curriculum.
- 3 participants
- 41 minutes
9 Feb 2022
Presentation: November 5th, 2021
Moderator: Florence Hudson
Presenters: Jay Yang, Richard Biever, and Inna Kouper
Bios:
Jay Yang: S. Jay Yang is currently a Professor in the Department of Computer Engineering and Director of Global Outreach for Global Cybersecurity Institute at Rochester Institute of Technology. Supported by NSF, NSA, IARPA, DARPA, AFRL, ONR, and ARL, his research team has developed several pioneering machine learning, attack modeling, and simulation systems to enhance cyber situational awareness and enable anticipatory cyber defense. His earlier works included FuSIA, VTAC, ViSAw, F-VLMM, CASCADES, CAPTURE, and attack obfuscation modeling. More recently, his team has developed ASSERT, HeAT-PATRL, and CLEAR-ROAD. He was a NSF Trusted CI Open Science Fellow in 2019 and TTP Fellow in 2020, and received IEEE Region 1 Outstanding Teaching in an IEEE Area of Interest Award – for outstanding leadership and contributions to cybersecurity and computer engineering education
Richard Biever: Richard Biever is Duke University's chief information security officer and a senior director in the Office of Information Technology. He has served in previous roles with the Georgia Institute of Technology's Office of Information Technology and Hewlett Packard. Under Richard’s leadership, Duke University has developed a strong, collaborative office working with university departments, research faculty, other higher education partners to develop and enable effective cybersecurity defenses at Duke. STINGAR, a Duke-developed and NSF-funded threat intelligence and sharing initiative, is currently being deployed beyond Duke with a number of higher education partners. More information on STINGAR may be found at https://stingar.security.duke.edu.
Inna Kouper: Inna Kouper is Director of Researcher Engagement for the ResearchSOC and a Research Scientist at the Luddy School of Informatics, Computing, and Engineering, Indiana University Bloomington. She studies emerging technologies and data practices and works on projects that advance open science and data sharing. She is involved in several large-scale projects funded by the National Science Foundation and the Institute of Museum and Library Services to study communities of data producers and consumers, promote data stewardship and develop innovative methods of data collection, analysis, and preservation. Kouper has undergraduate and masters degrees in information systems, a PhD in sociology and a PhD in information science.
Moderator: Florence Hudson
Presenters: Jay Yang, Richard Biever, and Inna Kouper
Bios:
Jay Yang: S. Jay Yang is currently a Professor in the Department of Computer Engineering and Director of Global Outreach for Global Cybersecurity Institute at Rochester Institute of Technology. Supported by NSF, NSA, IARPA, DARPA, AFRL, ONR, and ARL, his research team has developed several pioneering machine learning, attack modeling, and simulation systems to enhance cyber situational awareness and enable anticipatory cyber defense. His earlier works included FuSIA, VTAC, ViSAw, F-VLMM, CASCADES, CAPTURE, and attack obfuscation modeling. More recently, his team has developed ASSERT, HeAT-PATRL, and CLEAR-ROAD. He was a NSF Trusted CI Open Science Fellow in 2019 and TTP Fellow in 2020, and received IEEE Region 1 Outstanding Teaching in an IEEE Area of Interest Award – for outstanding leadership and contributions to cybersecurity and computer engineering education
Richard Biever: Richard Biever is Duke University's chief information security officer and a senior director in the Office of Information Technology. He has served in previous roles with the Georgia Institute of Technology's Office of Information Technology and Hewlett Packard. Under Richard’s leadership, Duke University has developed a strong, collaborative office working with university departments, research faculty, other higher education partners to develop and enable effective cybersecurity defenses at Duke. STINGAR, a Duke-developed and NSF-funded threat intelligence and sharing initiative, is currently being deployed beyond Duke with a number of higher education partners. More information on STINGAR may be found at https://stingar.security.duke.edu.
Inna Kouper: Inna Kouper is Director of Researcher Engagement for the ResearchSOC and a Research Scientist at the Luddy School of Informatics, Computing, and Engineering, Indiana University Bloomington. She studies emerging technologies and data practices and works on projects that advance open science and data sharing. She is involved in several large-scale projects funded by the National Science Foundation and the Institute of Museum and Library Services to study communities of data producers and consumers, promote data stewardship and develop innovative methods of data collection, analysis, and preservation. Kouper has undergraduate and masters degrees in information systems, a PhD in sociology and a PhD in information science.
- 7 participants
- 54 minutes
3 Dec 2021
Presentation: December 3rd, 2021
Big Data Hubs Data Sharing and Cyberinfrastructure WG
Moderator: Melissa Cragin
Presenters: Kenton McHenry and Amanda Charbonneau
Title: EarthCube GeoCODES & Operationalizing FAIR at the Common Fund (Data Ecosystem)
Bios:
Kenton McHenry is the Associate Director for Software at the National Center for Supercomputing Applications (NCSA) at the University of Illinois at Urbana-Champaign. For the past 12 years Kenton has helped establish a team of Research Software Engineers to support the scientific community broadly in the software development, data management, and data analytics needs present in current research activities. Kenton has further served to drive the advancement of sustainable and reusable research cyberinfrastructure serving as a principal or co-principal investigator on efforts such as DIBBs Brown Dog to prototype a data transformation service, CSSI Clowder to support flexible data management leveraging machine learning, the Open Storage Network to pilot a national data fabric, PEcAn to drive infrastructure in support of ecological forecasting, and EarthCube piloting out activities to elevate software as full scholarly, peer reviewed, published objects.
Amanda Charbonneau is the Inreach and Outreach Coordinator for the Common Fund Data Ecosystem Coordination Center, an NIH initiative to increase access and use of NIH Common Fund datasets. She has an eclectic scientific background and has variously worked on topics such as population genetics, spectrometry, entomology, genomics, and building tank armor. She uses these experiences to train scientists to use new computational tools and data types, as well as to build and manage communities. Her interests include statistics, coding, video games, and occasionally tweeting at https://twitter.com/procrastinomics
Big Data Hubs Data Sharing and Cyberinfrastructure WG
Moderator: Melissa Cragin
Presenters: Kenton McHenry and Amanda Charbonneau
Title: EarthCube GeoCODES & Operationalizing FAIR at the Common Fund (Data Ecosystem)
Bios:
Kenton McHenry is the Associate Director for Software at the National Center for Supercomputing Applications (NCSA) at the University of Illinois at Urbana-Champaign. For the past 12 years Kenton has helped establish a team of Research Software Engineers to support the scientific community broadly in the software development, data management, and data analytics needs present in current research activities. Kenton has further served to drive the advancement of sustainable and reusable research cyberinfrastructure serving as a principal or co-principal investigator on efforts such as DIBBs Brown Dog to prototype a data transformation service, CSSI Clowder to support flexible data management leveraging machine learning, the Open Storage Network to pilot a national data fabric, PEcAn to drive infrastructure in support of ecological forecasting, and EarthCube piloting out activities to elevate software as full scholarly, peer reviewed, published objects.
Amanda Charbonneau is the Inreach and Outreach Coordinator for the Common Fund Data Ecosystem Coordination Center, an NIH initiative to increase access and use of NIH Common Fund datasets. She has an eclectic scientific background and has variously worked on topics such as population genetics, spectrometry, entomology, genomics, and building tank armor. She uses these experiences to train scientists to use new computational tools and data types, as well as to build and manage communities. Her interests include statistics, coding, video games, and occasionally tweeting at https://twitter.com/procrastinomics
- 5 participants
- 54 minutes
4 Nov 2021
November 4th 2021
Presenter: Ryan Kenny
Ryan Kenny. Ryan Kenny is a Lieutenant Colonel in the United States Army, serving in the Signal Corps. He has served in the 82nd Airborne Division, Special Operations Community, and 516th Signal Brigade. He has three combat tours in Afghanistan. He received a BA in cognitive psychology from the University of Notre Dame, in 2003, a MA in national security and strategic studies from the U.S. Naval War College, Newport, RI in 2015, and is pursuing his PhD in Engineering and Public Policy at Carnegie Mellon University, Pittsburgh, PA. His research interests include Human-machine Systems, Artificial Intelligence, and Behavioral Decision Making.
Presenter: Ryan Kenny
Ryan Kenny. Ryan Kenny is a Lieutenant Colonel in the United States Army, serving in the Signal Corps. He has served in the 82nd Airborne Division, Special Operations Community, and 516th Signal Brigade. He has three combat tours in Afghanistan. He received a BA in cognitive psychology from the University of Notre Dame, in 2003, a MA in national security and strategic studies from the U.S. Naval War College, Newport, RI in 2015, and is pursuing his PhD in Engineering and Public Policy at Carnegie Mellon University, Pittsburgh, PA. His research interests include Human-machine Systems, Artificial Intelligence, and Behavioral Decision Making.
- 2 participants
- 54 minutes
1 Oct 2021
October 1st, 2021
Moderator: Niall Gaffney
Presenters: Shawn McKee, Henry Neeman, and Scott Yockel
Bios:
Shawn McKee is a Research Scientist at the University of Michigan Physics Department and Founding Director of the Center for Network and Storage-Enabled Collaborative Computational Science (CNSECCS), part of the Michigan Institute for Computational Discovery and Engineering (MICDE). He also directs the ATLAS Great Lakes Tier-2 (AGLT2) Center for the ATLAS Experiment at the CERN Large Hadron Collider. Since 2001, Dr. McKee has been the Network Project Manager for US ATLAS, planning for and developing the necessary network environment to support the US ATLAS computing model. He also co-leads the USATLAS facilities and distributed computing area and the ATLAS Distributed Data Management area. Since June 2012 he has led the Open Science Grid (OSG) Network area. He is the Principal Investigator on the “NSF CC*DNI DIBBs: Multi-Institutional Open Storage Research InfraStructure (MI-OSiRIS)” project and co-Principal Investigator on the”NSFCIF21 DIBBs: EI: SLATE and the Mobility of Capability” and “CC* Integration: NetBASILISK: NETwork Border At Scale Integrating and Leveraging Individual Security Components” projects.
Dr. Henry Neeman is the Director of the OU Supercomputing Center for Education & Research, Executive Director for Research Computing, Associate Professor in the College of Engineering and Adjunct Associate Professor in the School of Computer Science at the University of Oklahoma. He and Dana Brunson have been appointed joint co-leads of the XSEDE Campus Engagement program, which includes the Campus Champions. He received his BS in computer science and his BA in statistics with a minor in mathematics from the State University of New York at Buffalo in 1987, his MS in CS from the University of Illinois at Urbana-Champaign in 1990 and his PhD in CS from UIUC in 1996. Prior to coming to OU, Dr. Neeman was a postdoctoral research associate at the National Center for Supercomputing Applications at UIUC, and before that served as a graduate research assistant both at NCSA and at the Center for Supercomputing Research & Development. In addition to his own teaching and research, Dr. Neeman has collaborated with dozens of research groups, applying High Performance Computing techniques in fields such as numerical weather prediction, bioinformatics and genomics, data mining, high energy physics, astronomy, nanotechnology, petroleum reservoir management, river basin modeling and engineering optimization. He serves as an ad hoc advisor to student researchers in many of these fields. Dr. Neeman's research interests include high performance computing, scientific computing, parallel and distributed computing and computer science education.
Moderator: Niall Gaffney
Presenters: Shawn McKee, Henry Neeman, and Scott Yockel
Bios:
Shawn McKee is a Research Scientist at the University of Michigan Physics Department and Founding Director of the Center for Network and Storage-Enabled Collaborative Computational Science (CNSECCS), part of the Michigan Institute for Computational Discovery and Engineering (MICDE). He also directs the ATLAS Great Lakes Tier-2 (AGLT2) Center for the ATLAS Experiment at the CERN Large Hadron Collider. Since 2001, Dr. McKee has been the Network Project Manager for US ATLAS, planning for and developing the necessary network environment to support the US ATLAS computing model. He also co-leads the USATLAS facilities and distributed computing area and the ATLAS Distributed Data Management area. Since June 2012 he has led the Open Science Grid (OSG) Network area. He is the Principal Investigator on the “NSF CC*DNI DIBBs: Multi-Institutional Open Storage Research InfraStructure (MI-OSiRIS)” project and co-Principal Investigator on the”NSFCIF21 DIBBs: EI: SLATE and the Mobility of Capability” and “CC* Integration: NetBASILISK: NETwork Border At Scale Integrating and Leveraging Individual Security Components” projects.
Dr. Henry Neeman is the Director of the OU Supercomputing Center for Education & Research, Executive Director for Research Computing, Associate Professor in the College of Engineering and Adjunct Associate Professor in the School of Computer Science at the University of Oklahoma. He and Dana Brunson have been appointed joint co-leads of the XSEDE Campus Engagement program, which includes the Campus Champions. He received his BS in computer science and his BA in statistics with a minor in mathematics from the State University of New York at Buffalo in 1987, his MS in CS from the University of Illinois at Urbana-Champaign in 1990 and his PhD in CS from UIUC in 1996. Prior to coming to OU, Dr. Neeman was a postdoctoral research associate at the National Center for Supercomputing Applications at UIUC, and before that served as a graduate research assistant both at NCSA and at the Center for Supercomputing Research & Development. In addition to his own teaching and research, Dr. Neeman has collaborated with dozens of research groups, applying High Performance Computing techniques in fields such as numerical weather prediction, bioinformatics and genomics, data mining, high energy physics, astronomy, nanotechnology, petroleum reservoir management, river basin modeling and engineering optimization. He serves as an ad hoc advisor to student researchers in many of these fields. Dr. Neeman's research interests include high performance computing, scientific computing, parallel and distributed computing and computer science education.
- 4 participants
- 36 minutes
20 Apr 2021
April 12th 2021: Data Science and Cyberinfrastructure Workshop
Presenter: Sagar Samtani, Kathy Benninger, Amarnath Gupta, and Jelena Mirkovic
Title: Cybersecurity as Big Data Science: An Interactive Workshop
Abstract/Description:
Cybersecurity Practitioners
• Overwhelmed by large volume, variety and fast-evolving data that are necessary to do their jobs
(intrusion detection, threat intelligence, bot detection, disinformation, malware evasion, etc.)
• Interested in learning and adopting AI/ML/Data Science tools but not sure how to begin.
Researchers working on AI/ML/Data Science for Cybersecurity
• Has limited (mostly unrealistic) data to work with.
• Faces challenges to bring research advances to practices.
Meanwhile, there might be data science techniques being overlooked.
So let’s bring the community together to talk about:
• Cybersecurity use-cases that could benefit from data science tools and approaches.
• Technical and non-technical challenges to treat cybersecurity as big data science.
• Plausible solutions and best practices to apply data science for cybersecurity.
• Training, education, and engagement strategies.
This INTERACTIVE workshop is meant for all of use to share thoughts!
• 73 in Cybersecurity
• 26 in Data Science
• 11 in HPC
• 7 in Other scientific domain areas
• 7 in Business Operations
• many “others” or did not answered
• From 11 countries
• Please use the poll to select reasons that you are here.
Use the following ways to share your thoughts:
• Google Doc (link in Zoom Chat)
• Zoom Chat
• Raise your ”virtual” hand in Zoom and we will call you to speak.
Panelists:
• Sagar Samtani, Indiana University
• Kathy Benninger, Pittsburgh Supercomputing Center
• Amarnath Gupta, San Diego Supercomputer Center, University of California San Diego
• Jelena Mirkovic, Information Sciences Institute, University of Southern California
• and YOU!
Topics:
• Use-cases
• Challenges
• Solutions/Best Practices
• Engagement/Training/Education
We will begin by hearing briefly from the panelists for use-cases and challenges.
Presenter: Sagar Samtani, Kathy Benninger, Amarnath Gupta, and Jelena Mirkovic
Title: Cybersecurity as Big Data Science: An Interactive Workshop
Abstract/Description:
Cybersecurity Practitioners
• Overwhelmed by large volume, variety and fast-evolving data that are necessary to do their jobs
(intrusion detection, threat intelligence, bot detection, disinformation, malware evasion, etc.)
• Interested in learning and adopting AI/ML/Data Science tools but not sure how to begin.
Researchers working on AI/ML/Data Science for Cybersecurity
• Has limited (mostly unrealistic) data to work with.
• Faces challenges to bring research advances to practices.
Meanwhile, there might be data science techniques being overlooked.
So let’s bring the community together to talk about:
• Cybersecurity use-cases that could benefit from data science tools and approaches.
• Technical and non-technical challenges to treat cybersecurity as big data science.
• Plausible solutions and best practices to apply data science for cybersecurity.
• Training, education, and engagement strategies.
This INTERACTIVE workshop is meant for all of use to share thoughts!
• 73 in Cybersecurity
• 26 in Data Science
• 11 in HPC
• 7 in Other scientific domain areas
• 7 in Business Operations
• many “others” or did not answered
• From 11 countries
• Please use the poll to select reasons that you are here.
Use the following ways to share your thoughts:
• Google Doc (link in Zoom Chat)
• Zoom Chat
• Raise your ”virtual” hand in Zoom and we will call you to speak.
Panelists:
• Sagar Samtani, Indiana University
• Kathy Benninger, Pittsburgh Supercomputing Center
• Amarnath Gupta, San Diego Supercomputer Center, University of California San Diego
• Jelena Mirkovic, Information Sciences Institute, University of Southern California
• and YOU!
Topics:
• Use-cases
• Challenges
• Solutions/Best Practices
• Engagement/Training/Education
We will begin by hearing briefly from the panelists for use-cases and challenges.
- 10 participants
- 1:23 hours
6 Nov 2020
Cindy Bruyere: Cindy Bruyere is a scientist at NCAR (National Center for Atmospheric Research). She is the director for NCAR’s Capacity Canter for Climate and Weather Extremes. A group that conducts actionable science to advance our understanding of weather and climate extremes on time scales from sub-seasonal to multi-decadal. Her background is in meteorology and environmental management.
Mike Daniels: Mike Daniels’ career has been dedicated to the leadership and development of complex computing, software engineering, data management and cyberinfrastructure systems for the geosciences. His recent projects range from executive leadership and governance, real-time data acquisition and access, development of science workflows, data management, and linking data artifacts to research by way of semantic architectures.
Date: 11/06/20
Presenter: Cindy Bruyere & Mike Daniels
Institution: UCAR
West Big Data Hub
Mike Daniels: Mike Daniels’ career has been dedicated to the leadership and development of complex computing, software engineering, data management and cyberinfrastructure systems for the geosciences. His recent projects range from executive leadership and governance, real-time data acquisition and access, development of science workflows, data management, and linking data artifacts to research by way of semantic architectures.
Date: 11/06/20
Presenter: Cindy Bruyere & Mike Daniels
Institution: UCAR
West Big Data Hub
- 7 participants
- 60 minutes
2 Oct 2020
Jim is a senior research scientist in the cybersecurity group at NCSA, where he leads the CILogon and SciTokens projects, which support identity and access management for research collaborations. Jim is also the deputy director of Trusted CI, and he is the chair of the Trustworthy Data Working Group. Jim received his Ph.D. in computer sciences from the University of Wisconsin-Madison, where he worked on the HTCondor project.
Date: 10/02/20
Presenter: Jim Basney
Institution: National Center for Supercomputing Applications
Date: 10/02/20
Presenter: Jim Basney
Institution: National Center for Supercomputing Applications
- 6 participants
- 55 minutes
4 Sep 2020
Geoffrey Charles Fox (http://www.dsc.soic.indiana.edu/, gcf@indiana.edu)
Fox received a Ph.D. in Theoretical Physics from Cambridge University, where he was Senior Wrangler. He is now a distinguished professor of Engineering, Computing, and Physics at Indiana University, where he is the director of the Digital Science Center. He previously held positions at Caltech, Syracuse University, and Florida State University after being a postdoc at the Institute for Advanced Study at Princeton, Lawrence Berkeley Laboratory, and Peterhouse College Cambridge. He has supervised the Ph.D. of 73 students and published around 1500 papers (over 540 with at least ten citations) in physics and computing with a hindex of 82 and over 38500 citations. He received the High-Performance Parallel and Distributed Computing (HPDC) Achievement Award and the ACM - IEEE CS Ken Kennedy Award for Foundational contributions to parallel computing in 2019. He is a Fellow of APS (Physics) and ACM (Computing) and works on the interdisciplinary interface between computing and applications. He is involved in several projects to enhance the capabilities of Minority Serving Institutions. He has experience in online education and its use in MOOCs for areas like Data and Computational Science. He is active in the Industry consortium MLPerf.
Date: 09/04/20
Presenter: Geoffrey Fox
Institution: Indiana University
Midwest Big Data Hub
Fox received a Ph.D. in Theoretical Physics from Cambridge University, where he was Senior Wrangler. He is now a distinguished professor of Engineering, Computing, and Physics at Indiana University, where he is the director of the Digital Science Center. He previously held positions at Caltech, Syracuse University, and Florida State University after being a postdoc at the Institute for Advanced Study at Princeton, Lawrence Berkeley Laboratory, and Peterhouse College Cambridge. He has supervised the Ph.D. of 73 students and published around 1500 papers (over 540 with at least ten citations) in physics and computing with a hindex of 82 and over 38500 citations. He received the High-Performance Parallel and Distributed Computing (HPDC) Achievement Award and the ACM - IEEE CS Ken Kennedy Award for Foundational contributions to parallel computing in 2019. He is a Fellow of APS (Physics) and ACM (Computing) and works on the interdisciplinary interface between computing and applications. He is involved in several projects to enhance the capabilities of Minority Serving Institutions. He has experience in online education and its use in MOOCs for areas like Data and Computational Science. He is active in the Industry consortium MLPerf.
Date: 09/04/20
Presenter: Geoffrey Fox
Institution: Indiana University
Midwest Big Data Hub
- 7 participants
- 53 minutes
5 Jun 2020
WG: DSCI
Date: 06/05/20
Presenter 1: Peter Rose & Ilya Zaslavsky
Institution: San Diego Supercomputer Center
Title: COVID-19-Net – using knowledge graphs to make sense of heterogeneous COVID datasets
West Big Data Hub
Presenter 2: Florence Hudson
Institution: Columbia University
Title: COVID Information Commons
Northeast Big Data Hub
Presenter 3: Christine Kirkpatrick
Institution: San Diego Supercomputer Center
Title: Virus Outbreak Data Network, with a spotlight on VODAN Africa
West Big Data Hub
Date: 06/05/20
Presenter 1: Peter Rose & Ilya Zaslavsky
Institution: San Diego Supercomputer Center
Title: COVID-19-Net – using knowledge graphs to make sense of heterogeneous COVID datasets
West Big Data Hub
Presenter 2: Florence Hudson
Institution: Columbia University
Title: COVID Information Commons
Northeast Big Data Hub
Presenter 3: Christine Kirkpatrick
Institution: San Diego Supercomputer Center
Title: Virus Outbreak Data Network, with a spotlight on VODAN Africa
West Big Data Hub
- 8 participants
- 1:01 hours
3 Apr 2020
Persistent Identifiers are commonly used for long term identification of publications (DOIs), published data sets (DataCite), and even people (ORCIDs). However, PIDs have could have more utility throughout the data lifecycle. ERPID and the FDOF are looking at ways to track workflow and provenance information with PIDs that could enable universal data interoperability and full reproducibility of computational workflows.
Date: 04/03/20
Presenter: Rob Quick
Institution: Indiana University
Midwest Big Data Innovation Hub
Date: 04/03/20
Presenter: Rob Quick
Institution: Indiana University
Midwest Big Data Innovation Hub
- 8 participants
- 1:18 hours
6 Mar 2020
Part 1: Data Confidentiality and Privacy Needs in Scientific Computing
Trusted CI scientific researchers must participate in conversations about the challenges that they face in dealing with data confidentiality and privacy constraints in their work. Our eventual aim is to produce documentation that will help the community work toward "right sized" security and privacy solutions that enable scientific progress, at acceptable costs to usability. To accomplish this, we are seeking conversations with a diverse range of scientists to best understand the ways in which security and privacy issues hinder obtaining the data that is needed to conduct certain types of scientific research, and also to understand those researchers' scientific computing workflows in sufficient detail to best work toward what acceptable solutions to address those challenges might look like in the future.
Date: 03/06/20
Presenter: Sean Peisert
Institution: Lawrence Berkley National Laboratory
West Big Data Innovation Hub
Part 2: A Brief Update from the Trustworthy Data Working Group
Jim Basney and Melissa Cragin provide a brief update from the Trustworthy Data Working Group, a collaboration between Trusted CI, the BDHubs, and others to survey science projects to determine the spectrum of data security concerns and practices in the scientific community and to provide guidance on data security for open science, to improve scientific productivity and trust in scientific results. The working group is drafting the survey now, and we welcome your participation in the group and your assistance with obtaining survey results from a broad and diverse group of science projects.
Date: 03/06/20
Presenter: Jim Basney & Melissa Cragin
Institution: National Center for Supercomputing Applications & San Diego Supercomputer Center
Midwest & West Big Data Innovation Hubs
Trusted CI scientific researchers must participate in conversations about the challenges that they face in dealing with data confidentiality and privacy constraints in their work. Our eventual aim is to produce documentation that will help the community work toward "right sized" security and privacy solutions that enable scientific progress, at acceptable costs to usability. To accomplish this, we are seeking conversations with a diverse range of scientists to best understand the ways in which security and privacy issues hinder obtaining the data that is needed to conduct certain types of scientific research, and also to understand those researchers' scientific computing workflows in sufficient detail to best work toward what acceptable solutions to address those challenges might look like in the future.
Date: 03/06/20
Presenter: Sean Peisert
Institution: Lawrence Berkley National Laboratory
West Big Data Innovation Hub
Part 2: A Brief Update from the Trustworthy Data Working Group
Jim Basney and Melissa Cragin provide a brief update from the Trustworthy Data Working Group, a collaboration between Trusted CI, the BDHubs, and others to survey science projects to determine the spectrum of data security concerns and practices in the scientific community and to provide guidance on data security for open science, to improve scientific productivity and trust in scientific results. The working group is drafting the survey now, and we welcome your participation in the group and your assistance with obtaining survey results from a broad and diverse group of science projects.
Date: 03/06/20
Presenter: Jim Basney & Melissa Cragin
Institution: National Center for Supercomputing Applications & San Diego Supercomputer Center
Midwest & West Big Data Innovation Hubs
- 7 participants
- 49 minutes
7 Feb 2020
ImPACT (Infrastructure for Privacy-Assured CompuTations) is an NSF funded BIGDATA project in its 3rd year of execution, bringing together experts from UNC and Duke to address some of the hard problems encountered by researchers, data providers and institutions when working on protected data. While focusing primarily on supporting analysis of PII data in social sciences as the primary use domain, tools developed for ImPACT are broadly applicable to other domains and types of data. This presentation describes what the project has been able to accomplish so far.
Date: 2/7/20
Presenter: Laura Christopherson
Institution: RENCI
South Big Data Hub
Date: 2/7/20
Presenter: Laura Christopherson
Institution: RENCI
South Big Data Hub
- 10 participants
- 55 minutes
1 Nov 2019
Arvados: An open source platform for storing, organizing, processing, and sharing genomic and other big data.
Date: 11/1/2019
Presenter: Tom Morris
Institution: Veritas Genetics
South Big Data Hub
Date: 11/1/2019
Presenter: Tom Morris
Institution: Veritas Genetics
South Big Data Hub
- 3 participants
- 31 minutes
4 Oct 2019
Date: 10/4/2019
Presenter: Von Welch
Institution: Indiana University
Midwest Big Data Hub
Presenter: Von Welch
Institution: Indiana University
Midwest Big Data Hub
- 3 participants
- 23 minutes
20 Jun 2019
Date: 11/2/2019
Presenter: Jim Wilgenbusch
Institution: Minnesota Supercomputing Institute
Midwest Big Data Hub
Presenter: Jim Wilgenbusch
Institution: Minnesota Supercomputing Institute
Midwest Big Data Hub
- 1 participant
- 17 minutes
7 Jun 2019
Date: 6/7/2019
Presenter: Jim Wilgenbusch
Institution: Minnesota Supercomputing Institute
Midwest Big Data Hub
Presenter: Jim Wilgenbusch
Institution: Minnesota Supercomputing Institute
Midwest Big Data Hub
- 2 participants
- 32 minutes
3 May 2019
Date: 5/3/2019
Presenter: Valentin Pentchev
Institution: Indiana University
Midwest Big Data Hub
Presenter: Valentin Pentchev
Institution: Indiana University
Midwest Big Data Hub
- 3 participants
- 28 minutes
5 Apr 2019
Date: 4/5/2019
Presenters: Claudio Caimi and Mirko Manea
Institution: HP Enterprise (Italy)
Presenters: Claudio Caimi and Mirko Manea
Institution: HP Enterprise (Italy)
- 5 participants
- 26 minutes
1 Mar 2019
Date: 3/1/2019
Presenter: Ray Idaszak
Institution: RENCI
South Big Data Hub
Presenter: Ray Idaszak
Institution: RENCI
South Big Data Hub
- 2 participants
- 30 minutes
1 Feb 2019
Date: 2/1/2019
Presenter: Shane Glass
Institution: Google
West Big Data Hub
Presenter: Shane Glass
Institution: Google
West Big Data Hub
- 3 participants
- 57 minutes
11 Jan 2019
Date: 1/11/2019
Presenter: Commissioner Monica Bharel
Institution: Massachusetts Department of Public Health
Northeast Big Data Hub
Presenter: Commissioner Monica Bharel
Institution: Massachusetts Department of Public Health
Northeast Big Data Hub
- 3 participants
- 38 minutes
2 Nov 2018
Date: 11/2/2018
Presenter: Ben Blaiszik
Institution: University of Chicago
Midwest Big Data Hub
Presenter: Ben Blaiszik
Institution: University of Chicago
Midwest Big Data Hub
- 2 participants
- 14 minutes
5 Oct 2018
Date: 10/5/2018
Presenter: Matthew Lange
Institution: University of California, Davis
West Big Data Hub
Presenter: Matthew Lange
Institution: University of California, Davis
West Big Data Hub
- 2 participants
- 26 minutes
5 Oct 2018
Date: 10/5/2018
Presenter: Gabe Youtsey
Institution: University of California Agriculture and Natural Resources
West Big Data Hub
Presenter: Gabe Youtsey
Institution: University of California Agriculture and Natural Resources
West Big Data Hub
- 3 participants
- 25 minutes
7 Sep 2018
Date: 9/7/2018
Presenter: Patrick McGarry
Institution: data.world
South Big Data Hub
Presenter: Patrick McGarry
Institution: data.world
South Big Data Hub
- 4 participants
- 27 minutes
1 Jun 2018
Date: 6/1/2018
Presenter: Natalia Ruiz Juri
Institution: The University of Texas at Austin
South Big Data Hub
Presenter: Natalia Ruiz Juri
Institution: The University of Texas at Austin
South Big Data Hub
- 2 participants
- 19 minutes
1 Jun 2018
Date: 6/1/2018
Presenter: Craig Willis
Institution: National Center for Supercomputing Applications
Midwest Big Data Hub
Presenter: Craig Willis
Institution: National Center for Supercomputing Applications
Midwest Big Data Hub
- 3 participants
- 15 minutes
4 May 2018
Date: 05/04/18
Presenter: Jen Duthie
Institution: City of Austin, TX
South Big Data Hub
Presenter: Jen Duthie
Institution: City of Austin, TX
South Big Data Hub
- 2 participants
- 21 minutes
4 May 2018
Date: 5/4/18
Presenter: Rachel Bain
Institution: Massachusetts Department of Transportation
Northeast Big Data Hub
Presenter: Rachel Bain
Institution: Massachusetts Department of Transportation
Northeast Big Data Hub
- 3 participants
- 27 minutes
6 Apr 2018
Date: 04/06/18
Presenter: Dave Tarboton
Institution: Utah State University
West Big Data Innovation Hub
Presenter: Dave Tarboton
Institution: Utah State University
West Big Data Innovation Hub
- 2 participants
- 26 minutes
6 Apr 2018
Date: 4/6/18
Presenter: Mike Conway
Institution: National Institute of Environmental Health Sciences (NIEHS)
South Big Data Hub
Presenter: Mike Conway
Institution: National Institute of Environmental Health Sciences (NIEHS)
South Big Data Hub
- 5 participants
- 26 minutes
2 Feb 2018
Date: 02/02/18
Presenter: Kelly Gaither
Institution: Texas Advanced Computing Center
South Big Data Hub
Presenter: Kelly Gaither
Institution: Texas Advanced Computing Center
South Big Data Hub
- 3 participants
- 25 minutes
2 Feb 2018
Date: 2/2/18
Presenter: Jacob Bor
Institution: Boston University
Northeast Big Data Hub
Presenter: Jacob Bor
Institution: Boston University
Northeast Big Data Hub
- 4 participants
- 26 minutes
5 Jan 2018
Date: 1/5/18
Presenter: Ellen Rathje
Institution: UT Austin
South Big Data Hub
Presenter: Ellen Rathje
Institution: UT Austin
South Big Data Hub
- 4 participants
- 21 minutes
5 Jan 2018
Date: 1/5/18
Presenter: Jason Coposky
Institution: Renaissance Computing Institute
South Big Data Hub
Presenter: Jason Coposky
Institution: Renaissance Computing Institute
South Big Data Hub
- 3 participants
- 17 minutes
6 Nov 2017
Date;: 11/6/2017
Presenter: Christine Kirkpatrick (SDSC) & Niall Gaffney (TACC)
Institutions: San Diego Supercomputer Center; Texas Advanced Computing Center
West Big Data Hub
Presenter: Christine Kirkpatrick (SDSC) & Niall Gaffney (TACC)
Institutions: San Diego Supercomputer Center; Texas Advanced Computing Center
West Big Data Hub
- 5 participants
- 29 minutes
28 Apr 2017
Date: 04/28/17
Presenter: Vas Vasiliadis
Institution: University of Chicago
Midwest Big Data Hub
Presenter: Vas Vasiliadis
Institution: University of Chicago
Midwest Big Data Hub
- 2 participants
- 29 minutes
28 Apr 2017
(***)
Date: 04/28/17
Presenters: Vani Mandava & Jeff Prosise
Institution: Microsoft
West Big Data Hub
Date: 04/28/17
Presenters: Vani Mandava & Jeff Prosise
Institution: Microsoft
West Big Data Hub
- 4 participants
- 35 minutes
31 Mar 2017
Date: 03/31/17
Presenters: Florence Hudson & John Moore
Institution: Internet2
Midwest Big Data Hub
Presenters: Florence Hudson & John Moore
Institution: Internet2
Midwest Big Data Hub
- 3 participants
- 38 minutes
31 Mar 2017
Date: 03/31/17
Presenters: Claris Castillo (RENCI) & Alex Feltus (Clemson)
Institutions: Renaissance Computing Institute & Clemson University
South Big Data Hub
Presenters: Claris Castillo (RENCI) & Alex Feltus (Clemson)
Institutions: Renaissance Computing Institute & Clemson University
South Big Data Hub
- 7 participants
- 48 minutes
2 Mar 2017
Date: 03/02/17
Presenter: Gustavo Durand
Institution: Harvard University
Northeast Big Data Hub
Presenter: Gustavo Durand
Institution: Harvard University
Northeast Big Data Hub
- 3 participants
- 20 minutes
2 Mar 2017
Date: 03/02/17
Presenter: Chris Navarro & Jong Lee
Institution: National Center for Supercomputing Applications (NCSA)
Midwest Big Data Hub
Presenter: Chris Navarro & Jong Lee
Institution: National Center for Supercomputing Applications (NCSA)
Midwest Big Data Hub
- 5 participants
- 19 minutes
17 Feb 2017
Date: 02/17/17
Presenter: Jennifer Hammock
Institution: Smithsonian Institution
South Big Data Hub
Presenter: Jennifer Hammock
Institution: Smithsonian Institution
South Big Data Hub
- 3 participants
- 29 minutes
17 Feb 2017
Date: 02/17/17
Presenter: Matt Spitzer
Institution: Center for Open Science
South Big Data Hub
Presenter: Matt Spitzer
Institution: Center for Open Science
South Big Data Hub
- 3 participants
- 29 minutes
3 Feb 2017
Date: 02/03/17
Presenter: Hao Xu
Institution: Renaissance Computing Institute (RENCI)
South Big Data Hub
Presenter: Hao Xu
Institution: Renaissance Computing Institute (RENCI)
South Big Data Hub
- 2 participants
- 25 minutes
3 Feb 2017
Date: 02/03/17
Presenter: Chun-kun "Amos" Wang
Institution: UNC-Chapel Hill
South Big Data Hub
Presenter: Chun-kun "Amos" Wang
Institution: UNC-Chapel Hill
South Big Data Hub
- 3 participants
- 18 minutes
20 Jan 2017
Date: 1/20/17
Presenter: Mike Conway
Institution: Renaissance Computing Institute (RENCI)
South Big Data Hub
Presenter: Mike Conway
Institution: Renaissance Computing Institute (RENCI)
South Big Data Hub
- 2 participants
- 19 minutes
20 Jan 2017
Date: 01-20-17
Presenter: Kenton McHenry
Institution: National Center for Supercomputing Applications (NCSA)
Midwest Big Data Hub
Presenter: Kenton McHenry
Institution: National Center for Supercomputing Applications (NCSA)
Midwest Big Data Hub
- 2 participants
- 18 minutes
20 Jan 2017
Date: 1/20/17
Presenter: Russ Clark
Institution: Georgia Tech U
South Big Data Hub
Presenter: Russ Clark
Institution: Georgia Tech U
South Big Data Hub
- 2 participants
- 27 minutes
11 Nov 2016
Date: 11/11/2016
Presenter: Kenton McHenry
Institution: National Center for Supercomputing Applications (NCSA)
Midwest Big Data Hub
Presenter: Kenton McHenry
Institution: National Center for Supercomputing Applications (NCSA)
Midwest Big Data Hub
- 1 participant
- 31 minutes
11 Nov 2016
Date: 11/11/2016
Presenter: Carol Song
Institution: Purdue University
Midwest Big Data Hub
Presenter: Carol Song
Institution: Purdue University
Midwest Big Data Hub
- 2 participants
- 28 minutes
11 Nov 2016
Date: 11/11/16
Presenter: Jane Greenberg
Institution: Drexel University
Northeast Big Data Hub
Presenter: Jane Greenberg
Institution: Drexel University
Northeast Big Data Hub
- 3 participants
- 26 minutes