Computer Vision

MobileNetV3 with CBAM for Bamboo Stick Counting

The experimental data in this paper comes from the bamboo sticks provided by farmers who sell bamboo in Anji. We randomly grab less than 100 bamboo sticks and bundle them together. The heights of 5cm, 10cm, 15cm, and 20cm were taken from the front and left and right inclination to take pictures, screen clear and effective experimental data, and then use labelimg software to label them. The sparse bamboo stick samples collected were 600.

Categories:: Image Processing
Computer Vision

533 Views

Datatset of human gaze, environmental point clounds and RGB images during indoor locomotion

This is the data for the paper "Fusion of Human Gaze and Machine Vision for Predicting Intended Locomotion Mode" published on IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2022.

Categories:: Machine Learning
Wearable Sensing
Computer Vision

234 Views

StairNet: A Computer Vision Dataset for Stair Recognition

Visual perception can improve transitions between different locomotion mode controllers (e.g., level-ground walking to stair ascent) by sensing the walking environment prior to physical interactions. Here we developed the "StairNet" dataset to support the development of vision-based stair recognition systems. The dataset builds on ExoNet – the largest open-source dataset of egocentric images of real-world walking environments.

Categories:: Artificial Intelligence
Machine Learning
Wearable Sensing
Computer Vision

2015 Views

BVI-Lowlight-Images

One of the weak points of most of denoising algoritms (deep learning based ones) is the training data. Due to no or very limited amount of groundtruth data available, these algorithms are often evaluated using synthetic noise models such as Additive Zero-Mean Gaussian noise. The downside of this approach is that these simple model do not represent noise present in natural imagery.

Categories:: Digital signal processing
Image Processing
Computer Vision

483 Views

Retail Gaze: Gaze Estimation in Retail Environment

Retail Gaze, a dataset for remote gaze estimation in real-world retail environments. Retail Gaze is composed of 3,922 images of individuals looking at products in a retail environment, with 12 camera capture angles.

Each image captures the third-person view of the customer and shelves. Location of the gaze point, the Bounding box of the person's head, segmentation masks of the gazed at product areas are provided as annotations.

Categories:: Artificial Intelligence
Machine Learning
Social Sciences
Computer Vision

555 Views

LCMFD(LIGHT COVID MASKED FACE DATASETS)

A dataset with more comprehensive category labels, richer data scenes, and more diverse image sizes were constructed. All images had been labeled.
The num of all annotations is 8232. This dataset is openly accessible to all future research workers for rapid deployment of mask detection subtasks during the New Crown out- break and in all possible future scenarios.

Categories:: Computer Vision

217 Views

Silhouette-based 3D Human Pose Estimation Using a Single Wrist-mounted 360° Camera

In this paper, we propose a framework for 3D human pose estimation using a single 360° camera mounted on the user's wrist. Perceiving a 3D human pose with such a simple setup has remarkable potential for various applications (e.g., daily-living activity monitoring, motion analysis for sports training). However, no existing method has tackled this task due to the difficulty of estimating a human pose from a single camera image in which only a part of the human body is captured, and because of a lack of training data.

Categories:: Machine Learning
Image Processing
Computer Vision

242 Views

SciBank: A Large Dataset of Annotated Scientific Paper Regions for Document Layout Analysis

Document layout analysis (DLA) plays an important role for identifying and classifying the different regions of digital documents in the context of Document Understanding tasks. In light of this, SciBank seeks to provide a considerable amount of data from text (abstract, text blocks, caption, keywords, reference, section, subsection, title), tables, figures and equations (isolated equations and inline equations) of 74435 scientific articles pages. Human curators validated that these 12 regions were properly labeled.

Categories:: Machine Learning
Image Processing
Computer Vision

1440 Views

Application Research of Dynamic Deformation Monitoring using Smart Phone Camera- A Collection of Images

This is a collection of images.

Categories:: Sensors
Image Processing
Computer Vision
Geoscience and Remote Sensing

125 Views

Dataset for Machine Learning-Based Classification of White Blood Cells of the Juvenile Visayan Warty Pig

This dataset was prepared to aid in the creation of a machine learning algorithm that would classify the white blood cells in thin blood smears of juvenile Visayan warty pigs. The creation of this dataset was deemed imperative because of the limited availability of blood smear images collected from the critically endangered species on the internet. The dataset contains 3,457 images of various types of white blood cells (JPEG) with accompanying cell type labels (XLSX).

Categories:: Artificial Intelligence
Machine Learning
Image Processing
Biomedical and Health Sciences
Computer Vision
Ecology
Climate Change/Environmental

2621 Views

Computer Vision

Computer Vision

This is the data for the paper "Fusion of Human Gaze and Machine Vision for Predicting Intended Locomotion Mode" published on IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2022.

Pages