Loading…
A reputation mechanism based Deep Reinforcement Learning and blockchain to suppress selfish node attack motivation in Vehicular Ad-Hoc Network
The selfish On-Board-Unit (OBU) attacks Vehicular Ad-Hoc Network (VANET) by various attacks for profit. However, many existing methods are based on the principle of direct reciprocity for communication, which makes it easy for large-scale networks to crash when an attack occurs. In order to reduce t...
Saved in:
Published in: | Future generation computer systems 2023-02, Vol.139, p.17-28 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The selfish On-Board-Unit (OBU) attacks Vehicular Ad-Hoc Network (VANET) by various attacks for profit. However, many existing methods are based on the principle of direct reciprocity for communication, which makes it easy for large-scale networks to crash when an attack occurs. In order to reduce the number of attackers in the vehicle ad hoc network and restrain the attack motivation of the OBUs, we propose an indirect reciprocal incentive mechanism based on reputation to encourage the OBUs in the VANET to help each other. Since most OBUs are in great need of network services, including potential attackers, when the loss of network services is far greater than the illegal benefits of their attacks, selfish and rational OBUs will give up attacks and take desirable behavior. In addition, to prevent some attacks from tampering with information, we also apply blockchain technology to record the behavior of OBU. The indirect reciprocity process of each OBU in VANET can be regarded as Markov Decision Process (MDP). In order to restrain the attack motivation of selfish nodes and communicate normally without knowing the attack model, an algorithm based on Deep Reinforcement Learning (DRL) is proposed to suppress attack motivation, so as to activate OBU learning in a dynamic environment and make wise decisions. Finally, through a large number of simulation experiments, the performance of our proposed algorithm is obviously better than that of the baseline strategy, and is verified by the simulation results.
•Indirect reciprocity security framework to compute and get the reputation of OBU.•Propose DSAM to learn the process of behavior selection and propagation of OBU.•Discussed the attack rate of the network with and without attack classification.•Blockchain technology to prevent malicious modification of the reputation mechanism. |
---|---|
ISSN: | 0167-739X 1872-7115 |
DOI: | 10.1016/j.future.2022.09.010 |