Hand and Glove Segmentation Dataset for Department of Energy Glovebox Environments

Version 1.0

Sharma, Shivansh; Huang, Mathew; Nair, Sanat; Wen, Alan; Petlowany, Christina; Wanna, Selma; Pryor, Mitch, 2024, "Hand and Glove Segmentation Dataset for Department of Energy Glovebox Environments", https://doi.org/10.18738/T8/85R7KQ, Texas Data Repository, V1, UNF:6:BZYKg5wZp93zBXaXMpgeRQ== [fileUNF]

Learn about Data Citation Standards.

Contact Owner

Make Data Count (MDC) Metrics

since

18,645 Views

18,429 Downloads

0 Citations

Description	Introduction The Hand and Glove Segmentation Dataset for Department of Energy (DOE) Glovebox Environments (HAGS) is a robot allocentric perception dataset. It aims to improve safety and accuracy in human-robot collaboration (HRC), particularly in glovebox environments. The dataset's incorporation of diverse HRC experiments, including building a Jenga block tower and disassembling a small box, enhances diversity, variability, and reproducibility, providing a comprehensive representation of interactions for robust and generalizable studies in human-robot interaction. The dataset is instrumental in advancing safety systems and robotic aid in real-time scenarios within the field of machine learning, fostering the development of intelligent, reliable solutions for human-robot collaboration. Dataset Preview Dataset Characteristics As mentioned, the dataset captures two human-robot collaboration experiments. In the first experiment, participants built a Jenga block tower, receiving six blocks from the robot manipulator arm. The second experiment required participants to disassemble a box, with the robot manipulator arm providing the participant with different screwdrivers to remove four screws. Each participant repeated both experiments four times under the following conditions: a) wearing gloves, b) ungloved, c) with a green screen placed along the bottom of the glovebox, and d) without a green screen placed along the bottom. Lastly, each experiment was recorded from two distinct camera angles: a top view and a side view. Dataset Contents The dataset contains: Ten participants conducted two experiments, each involving four variables and two camera angles, resulting in a total of 16 videos per participant. Eight hours of video footage of each experiment. 2876 annotated in-distribution and out-of-distribution frames. 1438 original, unannotated sampled frames. Data Collection The data was collected in a standard glovebox commonly utilized by researchers in the DOE. Each video provides two camera angles: one from a bird's eye view captured by a 1080p GoPro, and another from a 1080p Intel RealSense Development Kit Camera recording from the right side of the participant. To assist the participants, a Universal Robots UR3e robot manipulator arm equipped with a gripper for object handling was pre-programmed to conduct the two tasks. Two researchers assisted in the experiment, one operating the robot arm and the other assisting in object placement. The collection process ensured normally distributed frames from each video, which were then annotated. Data Post-Processing for Machine Learning For applications with machine learning, the sampled frames were split into two sets: a) in-distribution set and b) out-of-distribution set. The in-distribution set contains the most likely scenarios to occur with human-robot collaboration work in a glovebox, providing applicability in model training. Therefore, videos without a green screen in the background and with participants wearing gloves were designated as the in-distribution set. The rest of the videos contain either the participant not wearing gloves and/or a green screen placed in the background. These scenarios are less likely to occur in a glovebox setting, and thus related frames were placed in the out-of-distribution set, providing applicability in model evaluation. 1440 frames were sampled for labeling. These frames were sampled equally distributed across all the videos, with 120 in-distribution frames and 24 out-of-distribution frames sampled per participant (except Participant 6); further explanation regarding the exception for Participant 6 can be found in the Data Quality Statement in the Dataset Report. Data Annotation The data was manually annotated by four researchers who were divided into two groups. Three classes were assigned to each image: left hand, right hand, and background. Human annotators were instructed to annotate each hand from the tip to the wrist and provide their best estimate of the wrist location when the subject was wearing gloves. The human annotator supplied the open-sourced Segment Anything Model with a user box or point prompt to generate an initial annotation for each frame, using it in conjunction with the open-sourced Label Studio annotation tool. Subsequently, a human annotator then adjusted each annotation for precision. Two annotators labeled each image to promote inter-annotator agreement. Each frame’s annotation was converted to a single PNG file, where the three classes were recorded: left hand, right hand, and background. Models and Data Usage Using the annotated dataset, multiple semantic segmentation models were trained. The official model training code and configurations for this dataset are in GitHub. The link to the GitHub repository is provided in the Software metadata field below. Human Subjects This study was approved by the University of Texas at Austin Institutional Review Board (IRB) under the IRB ID: STUDY00003948. Anyone present in the recorded data and their observed behavior provided consent. To provide a comprehensive representation of collaborative scenarios, a diverse pool of participants was selected. To protect their privacy, participants with recognizable features were asked to cover them up through the use of makeup or other methods. The only characteristic identified by the dataset is race. Anyone who revoked their consent and expressed so was noted and removed from the data and the annotations. Included in this data package are the IRB-exempt determination and the Research Information Sheet distributed to the participants. Dataset Organization The dataset is organized in the following manner: It is recommended users first inspect the metadata under the metadata directory to understand which files should be used for their task. For an in-depth explanation of the dataset file structure, refer to the Dataset Report included in this dataset. Use the "tree" view for the metadata to better observe the dataset structure. Dataset Quality Statement The research team maintains high data quality by rigorously adhering to standardized protocols during experimentation, ensuring consistency and reproducibility in participant procedures. Participant adherence to protocols is meticulously monitored to uphold data quality, including videos and pictures captured during sessions. Inter-annotator agreement is established during the annotation process, with each annotation reviewed by two individuals to improve accuracy and reliability. Comprehensive documentation is maintained throughout data collection to ensure traceability and facilitate auditing. All dataset contents are thoroughly documented, ensuring transparency and reproducibility in findings. Finally, specific dataset noise is reported in the Dataset Report to provide insight into potential anomalies or discrepancies within the data. Bulk Data Download A script named download_data.py is provided for bulk data download. To run the script, save it as a '.py' file, navigate to its directory in the terminal, and execute it using the 'python' command, ensuring all required dependencies are installed. Before running the script, ensure a stable internet connection as it will take some time to download all files from the Dataverse repository.
Subject	Engineering; Computer and Information Science
Keyword	Robotics, Glovebox, Human-Robot Collaboration, Machine Learning, Hand Segmentation, Semantic Segmentation, Robot Perception, Department of Energy, Robot Safety Systems, Real-time Detection
License/Data Use Agreement	CC0 1.0

Change View

Table

Tree

Filter by

	1 to 10 of 4,560 Files	Original Format Archival Format (.tab)
	download_data.py bulk_data_downloader/Python Source Code - 4.0 KB Published Feb 29, 2024 5 Downloads MD5: 4f5d721119d58d53b199567be03cee48	Access File File Access Public Download Options Python Source Code Download Metadata Data File Citation EndNote XML RIS BibTeX
	Jenga_GL_0.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 7.6 KB Published Feb 29, 2024 9 Downloads MD5: feff6dc252d6cfa6be409a15c659bff6	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_0.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Jenga_GL_1012.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 7.8 KB Published Feb 29, 2024 8 Downloads MD5: a65ab1bee9c79c0979a1c925d0c6cc7c	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_1012.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Jenga_GL_1181.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 7.8 KB Published Feb 29, 2024 8 Downloads MD5: 73fb1ca328cccaf07789757e9159cf1c	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_1181.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Jenga_GL_1349.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 8.4 KB Published Feb 29, 2024 8 Downloads MD5: 4e527376dc7d2270ee669ffe52c03431	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_1349.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Jenga_GL_1518.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 7.7 KB Published Feb 29, 2024 8 Downloads MD5: d0f1910d8a532bd745da68bbacc41585	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_1518.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Jenga_GL_168.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 7.6 KB Published Feb 29, 2024 8 Downloads MD5: 67280e972a72123b5667baef400ee716	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_168.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Jenga_GL_1687.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 7.9 KB Published Feb 29, 2024 8 Downloads MD5: fde2caea95a19588c64e9e81dc3dc921	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_1687.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Jenga_GL_1855.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 8.2 KB Published Feb 29, 2024 8 Downloads MD5: 2401ad13ea0b898b6c328e81405a877a	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_1855.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Jenga_GL_2024.0.png Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/PNG Image - 8.0 KB Published Feb 29, 2024 8 Downloads MD5: 81bcc1d9722b01aa8ea40d78d6907d57	Preview "Dataset/Annotations/Test_Subject_1/By_1/id/Jenga_task/Side_View/Jenga_GL_2024.0.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image

Citation Metadata

Persistent Identifier	doi:10.18738/T8/85R7KQ
Publication Date	2024-02-29
Title	Hand and Glove Segmentation Dataset for Department of Energy Glovebox Environments
Author	Sharma, Shivansh (University of Texas at Austin) Huang, Mathew (University of Texas at Austin) Nair, Sanat (University of Texas at Austin) Wen, Alan (University of Texas at Austin) Petlowany, Christina (University of Texas at Austin) Wanna, Selma (University of Texas at Austin) Pryor, Mitch (University of Texas at Austin)
Point of Contact	Use email button above to contact. Wanna, Selma (University of Texas at Austin)
Description	Introduction The Hand and Glove Segmentation Dataset for Department of Energy (DOE) Glovebox Environments (HAGS) is a robot allocentric perception dataset. It aims to improve safety and accuracy in human-robot collaboration (HRC), particularly in glovebox environments. The dataset's incorporation of diverse HRC experiments, including building a Jenga block tower and disassembling a small box, enhances diversity, variability, and reproducibility, providing a comprehensive representation of interactions for robust and generalizable studies in human-robot interaction. The dataset is instrumental in advancing safety systems and robotic aid in real-time scenarios within the field of machine learning, fostering the development of intelligent, reliable solutions for human-robot collaboration. Dataset Preview Dataset Characteristics As mentioned, the dataset captures two human-robot collaboration experiments. In the first experiment, participants built a Jenga block tower, receiving six blocks from the robot manipulator arm. The second experiment required participants to disassemble a box, with the robot manipulator arm providing the participant with different screwdrivers to remove four screws. Each participant repeated both experiments four times under the following conditions: a) wearing gloves, b) ungloved, c) with a green screen placed along the bottom of the glovebox, and d) without a green screen placed along the bottom. Lastly, each experiment was recorded from two distinct camera angles: a top view and a side view. Dataset Contents The dataset contains: Ten participants conducted two experiments, each involving four variables and two camera angles, resulting in a total of 16 videos per participant. Eight hours of video footage of each experiment. 2876 annotated in-distribution and out-of-distribution frames. 1438 original, unannotated sampled frames. Data Collection The data was collected in a standard glovebox commonly utilized by researchers in the DOE. Each video provides two camera angles: one from a bird's eye view captured by a 1080p GoPro, and another from a 1080p Intel RealSense Development Kit Camera recording from the right side of the participant. To assist the participants, a Universal Robots UR3e robot manipulator arm equipped with a gripper for object handling was pre-programmed to conduct the two tasks. Two researchers assisted in the experiment, one operating the robot arm and the other assisting in object placement. The collection process ensured normally distributed frames from each video, which were then annotated. Data Post-Processing for Machine Learning For applications with machine learning, the sampled frames were split into two sets: a) in-distribution set and b) out-of-distribution set. The in-distribution set contains the most likely scenarios to occur with human-robot collaboration work in a glovebox, providing applicability in model training. Therefore, videos without a green screen in the background and with participants wearing gloves were designated as the in-distribution set. The rest of the videos contain either the participant not wearing gloves and/or a green screen placed in the background. These scenarios are less likely to occur in a glovebox setting, and thus related frames were placed in the out-of-distribution set, providing applicability in model evaluation. 1440 frames were sampled for labeling. These frames were sampled equally distributed across all the videos, with 120 in-distribution frames and 24 out-of-distribution frames sampled per participant (except Participant 6); further explanation regarding the exception for Participant 6 can be found in the Data Quality Statement in the Dataset Report. Data Annotation The data was manually annotated by four researchers who were divided into two groups. Three classes were assigned to each image: left hand, right hand, and background. Human annotators were instructed to annotate each hand from the tip to the wrist and provide their best estimate of the wrist location when the subject was wearing gloves. The human annotator supplied the open-sourced Segment Anything Model with a user box or point prompt to generate an initial annotation for each frame, using it in conjunction with the open-sourced Label Studio annotation tool. Subsequently, a human annotator then adjusted each annotation for precision. Two annotators labeled each image to promote inter-annotator agreement. Each frame’s annotation was converted to a single PNG file, where the three classes were recorded: left hand, right hand, and background. Models and Data Usage Using the annotated dataset, multiple semantic segmentation models were trained. The official model training code and configurations for this dataset are in GitHub. The link to the GitHub repository is provided in the Software metadata field below. Human Subjects This study was approved by the University of Texas at Austin Institutional Review Board (IRB) under the IRB ID: STUDY00003948. Anyone present in the recorded data and their observed behavior provided consent. To provide a comprehensive representation of collaborative scenarios, a diverse pool of participants was selected. To protect their privacy, participants with recognizable features were asked to cover them up through the use of makeup or other methods. The only characteristic identified by the dataset is race. Anyone who revoked their consent and expressed so was noted and removed from the data and the annotations. Included in this data package are the IRB-exempt determination and the Research Information Sheet distributed to the participants. Dataset Organization The dataset is organized in the following manner: It is recommended users first inspect the metadata under the metadata directory to understand which files should be used for their task. For an in-depth explanation of the dataset file structure, refer to the Dataset Report included in this dataset. Use the "tree" view for the metadata to better observe the dataset structure. Dataset Quality Statement The research team maintains high data quality by rigorously adhering to standardized protocols during experimentation, ensuring consistency and reproducibility in participant procedures. Participant adherence to protocols is meticulously monitored to uphold data quality, including videos and pictures captured during sessions. Inter-annotator agreement is established during the annotation process, with each annotation reviewed by two individuals to improve accuracy and reliability. Comprehensive documentation is maintained throughout data collection to ensure traceability and facilitate auditing. All dataset contents are thoroughly documented, ensuring transparency and reproducibility in findings. Finally, specific dataset noise is reported in the Dataset Report to provide insight into potential anomalies or discrepancies within the data. Bulk Data Download A script named download_data.py is provided for bulk data download. To run the script, save it as a '.py' file, navigate to its directory in the terminal, and execute it using the 'python' command, ensuring all required dependencies are installed. Before running the script, ensure a stable internet connection as it will take some time to download all files from the Dataverse repository.
Subject	Engineering; Computer and Information Science
Keyword	Robotics Glovebox Human-Robot Collaboration Machine Learning Hand Segmentation Semantic Segmentation Robot Perception Department of Energy Robot Safety Systems Real-time Detection
Production Date	2023-04-07
Production Location	Nuclear and Applied Robotics Group Lab; https://robotics.me.utexas.edu/; Texas Robotics; University of Texas at Austin
Contributor	Data Curator : Maria Esteva
Depositor	Nair, Sanat
Deposit Date	2023-12-06
Date of Collection	Start Date: 2023-04-07 ; End Date: 2023-02-10
Data Type	experimental data; videos; images
Other Reference	Training models located at: https://github.com/UTNuclearRobotics/assembly_glovebox_dataset/tree/main/training

Dataset Terms

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Creative Commons CC0 1.0 Universal Public Domain Dedication. CC0 1.0

Restricted Files + Terms of Access

Dataset Version	Summary	Contributors	Published on
No records found.

Edit File

This file has already been deleted (or replaced) in the current version. It may not be edited.

Restrict Access

Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.

Learn about restricting files and dataset access in the User Guide.

Request Access

Enable access request

You must enable request access or add terms of access to restrict file access.

Terms of Access for Restricted Files

Save Changes

Edit Embargo

The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.

Delete Files

The file will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Select File(s)

Please select one or more files.

Share Dataset

Share this dataset on your favorite social media networks.

Continue

Dataset Citations

Citations for this dataset are retrieved from Crossref via DataCite using Make Data Count standards. For more information about dataset metrics, please refer to the User Guide.

Sorry, no citations were found.

Restricted Files Selected

The selected file(s) may not be downloaded because you have not been granted access.

You may request access to the restricted file(s) by clicking the Request Access button.

Download Options

The files selected are too large to download as a ZIP.

You can select individual files that are below the 1.9 GB download limit from the files table, or use the Data Access API for programmatic access to the files.

Select File(s)

Please select a file or files to be downloaded.

Restricted Files Selected

The restricted file(s) selected may not be downloaded because you have not been granted access.

Click Continue to download the files you have access to download.

Ineligible Files Selected

Some file(s) cannot be transferred. (They are restricted, embargoed, or not Globus accessible.)

Click Continue to transfer the elligible files.

Delete Dataset

Are you sure you want to delete this dataset and all of its files? You cannot undelete this dataset.

Delete Draft Version

Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.

Unpublished Dataset Private URL

Use a Private URL to allow those without Dataverse accounts to access your unpublished dataset. For more information about the Private URL feature, please refer to the User Guide.

Private URL has not been created.

Unpublished Dataset Private URL

Are you sure you want to disable the Private URL? If you have shared the Private URL with others they will no longer be able to use it to access your unpublished dataset.

Delete Files

The file(s) will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Compute

This dataset contains restricted files you may not compute on because you have not been granted access.

Deaccession Dataset

Are you sure you want to deaccession? The selected version(s) will no longer be viewable by the public.

Deaccession Dataset

Are you sure you want to deaccession this dataset? It will no longer be viewable by the public.

Version Differences Details

Please select two versions to view the differences.

Version Differences Details

Version:
Last Updated:

Select File(s)

Please select a file or files for access request.

Select File(s)

Embargoed files cannot be accessed. Please select an unembargoed file or files for your access request.

Edit Tags

Select existing file tags or create new tags to describe your files. Each file can have more than one tag.

Request Access

You need to Log In to request access.

Dataset Terms

Please confirm and/or complete the information needed below in order to request access to files in this dataset.

This dataset is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Creative Commons CC0 1.0 Universal Public Domain Dedication. CC0 1.0

Preview Guestbook

Upon downloading files the guestbook asks for the following information.

Guestbook Name

Collected Data

Account Information

Package File Download

Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL

Download URL

https://dataverse.tdl.org/api/access/datafile/

Compute Batch

Clear Batch

Dataset	Persistent Identifier	Change Compute Batch

Compute Batch

Submit for Review

You will not be able to make changes to this dataset while it is in review.

Publish Dataset

Are you sure you want to republish this dataset?

Select if this is a minor or major version update.

Minor Release (1.1)

Major Release (2.0)

Publish Dataset

This dataset cannot be published until Texas Robotics is published by its administrator.

Publish Dataset

This dataset cannot be published until Texas Robotics and University of Texas at Austin Dataverse Collection are published.

Return to Author

Return this dataset to contributor for modification.

Hand and Glove Segmentation Dataset for Department of Energy Glovebox Environments

Introduction

Dataset Characteristics

Dataset Contents

Data Collection

Data Post-Processing for Machine Learning

Data Annotation

Models and Data Usage

Human Subjects

Dataset Organization

Dataset Quality Statement

Bulk Data Download