V2X Message Classification, Prioritization, and Spam Detection Dataset

Citation Author(s):
Jivthesh
M R
Amrita Vishwa Vidyapeetham
Sai Shibu
N B
Amrita Vishwa Vidyapeetham
Sethuraman
N Rao
Amrita Vishwa Vidyapeetham
Submitted by:
Sai Shibu
Last updated:
Tue, 05/23/2023 - 00:40
DOI:
10.21227/yhfd-0d92
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

The dataset contains a collection of V2X (Vehicle-to-Everything) messages for classification, prioritization, and spam message detection. It comprises 1,000 messages with varying message types, content, priorities, and spam labels. The messages are sourced from different vehicles with specific destination vehicles or broadcast to all vehicles. They cover various message types, including traffic updates, emergency alerts, weather notifications, hazard warnings, roadwork information, and spam messages. The priority of the messages is categorized as either high, medium, or low. High-priority messages typically involve emergencies or critical situations that require immediate attention. Medium-priority messages include traffic updates, roadwork notices, and hazard warnings. Low-priority messages consist of spam or promotional content. The dataset includes a spam label to indicate whether a message is classified as spam. Spam messages contain promotional offers, limited-time deals, or discounts, while non-spam messages convey important information related to traffic, emergencies, weather conditions, and roadwork. The dataset aims to facilitate developing and evaluating machine learning models for V2X message classification, prioritization, and spam message detection. It can be used for training and testing models to automatically analyze and process V2X messages for effective communication and decision-making in intelligent transportation systems.

Instructions: 

V2X Message Classification, Prioritization, and Spam Detection Dataset
=====================================================================

Introduction
------------
The V2X Message Classification, Prioritization, and Spam Detection Dataset is a collection of V2X (Vehicle-to-Everything) messages designed for research and development purposes. This dataset aims to facilitate developing and evaluating machine learning models for V2X message analysis, including classification, prioritization, and spam message detection.

Dataset Overview
----------------
The dataset consists of 1,000 V2X messages, each with various attributes that can be used for different tasks related to V2X communication. The dataset includes the following information for each message:

1. Message ID: A unique identifier for each message.
2. Source Vehicle: The vehicle from which the message originates.
3. Destination Vehicle: The message's intended destination vehicle(s).
4. Message Type: The type or category of the message, such as traffic, emergency, weather, hazard, roadwork, or spam.
5. Message Content: The actual content of the message.
6. Priority: The priority level of the message, categorized as high, medium, or low.
7. Spam: A binary label indicating whether the message is classified as spam (1) or not (0).

Usage and Applications
----------------------
The dataset can be used for a variety of research and development purposes, including:

1. V2X Message Classification: Train machine learning models to classify V2X messages into different categories based on their content and type.
2. Prioritization of V2X Messages: Develop models to prioritize V2X messages based on their urgency and importance.
3. Spam Detection in V2X Messages: Build models to identify and filter out spam messages from the V2X communication channel.
4. Intelligent Transportation Systems (ITS): Evaluate the performance of V2X

File Format
-----------
The dataset is provided in CSV (Comma-Separated Values) format for easy integration with various data processing and machine learning frameworks. Each row in the CSV file represents a single V2X message, and the columns correspond to the attributes mentioned above.

License
-------
The V2X Message Classification, Prioritization, and Spam Detection Dataset are released under the [Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/) license. Please review the license terms for detailed information on the permitted use of the dataset.

We hope this dataset serves as a valuable resource for researchers and practitioners working in V2X communication and intelligent transportation systems.