Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm

Liangshun Wu; Peilin Liu; Junsuo Qu; Cong Zhang; Bin Zhang

doi:doi:10.11648/j.ijssn.20241201.11

Research Article |

| Peer-Reviewed

Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm

Liangshun Wu, Peilin Liu, Junsuo Qu, Cong Zhang, Bin Zhang^*

Published in International Journal of Sensors and Sensor Networks (Volume 12, Issue 1)

Received: 18 January 2024 Accepted: 29 January 2024 Published: 28 February 2024

Views: Downloads:

Download PDF

Share This Article

Twitter
Linked In
Facebook

Abstract

This paper presents an in-depth study of the application of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithms with an exploratory strategy for duty cycle scheduling (DCS) in the wireless sensor networks (WSNs). The focus is on optimizing the performance of sensor nodes in terms of energy efficiency and event detection rates under varying environmental conditions. Through a series of simulations, we investigate the impact of key parameters such as the sensor specificity constant α and the Poisson rate of events on the learning and operational efficacy of sensor nodes. Our results demonstrate that the MADDPG algorithm with an exploratory strategy outperforms traditional reinforcement learning algorithms, particularly in environments characterized by high event rates and the need for precise energy management. The exploratory strategy enables a more effective balance between exploration and exploitation, leading to improved policy learning and adaptation in dynamic and uncertain environments. Furthermore, we explore the sensitivity of different algorithms to the tuning of the sensor specificity constant α, revealing that lower values generally yield better performance by reducing energy consumption without significantly compromising event detection. The study also examines the algorithms' robustness against the variability introduced by different event Poisson rates, emphasizing the importance of algorithm selection and parameter tuning in practical WSN applications. The insights gained from this research provide valuable guidelines for the deployment of sensor networks in real-world scenarios, where the trade-off between energy consumption and event detection is critical. Our findings suggest that the integration of exploratory strategies in MADDPG algorithms can significantly enhance the performance and reliability of sensor nodes in WSNs.

Published in	International Journal of Sensors and Sensor Networks (Volume 12, Issue 1)
DOI	10.11648/j.ijssn.20241201.11
Page(s)	1-12
Creative Commons	This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.
Copyright	Copyright © The Author(s), 2024. Published by Science Publishing Group

Keywords

Multi-Agent Systems, Deep Reinforcement Learning, MADDPG, Wireless Sensor Networks, Energy Efficiency, Event Detection, Exploratory Strategy

References

[1]	Begum S, Wang S, Krishnamachari B, Helmy A. ELECTION: Energy-efficient and low-latency scheduling technique for wireless sensor networks. Proceedings of the 29th Annual IEEE Conference on Local Computer Networks (LCN). Tampa, FL, USA: IEEE; 2004. p. 16-18. https://orcid.org/10.1109/LCN.2004.49
[2]	Dantu R, Abbas K, O’Neill M II, Mikler A. Data centric modeling of environmental sensor networks. Proceedings of the IEEE Globecom. Dallas, TX, USA: IEEE, 2004: 447-452. https://orcid.org/10.1109/GLOCOMW.2004.1417621
[3]	Yang Y R, Lam S S. General AIMD congestion control. Proceedings of the 2000 International Conference on Network Protocols, 2000: 187-198. https://orcid.org/10.1109/ICNP.2000.896303
[4]	E. Ucer, M. C. Kisacikoglu and M. Yuksel. Analysis of Decentralized AIMD-based EV Charging Control. 2019 IEEE Power & Energy Society General Meeting (PESGM), Atlanta, GA, USA. 2019: 1-5. https://orcid.org/10.1109/PESGM40551.2019.8973725
[5]	Lee J, Lee D, Kim J, Cho W, Pajak J. A dynamic sensing cycle decision scheme for energy efficiency and data reliability in wireless sensor networks. Lecture Notes in Computer Science. 2007; 4681: 218-229. https://orcid.org/10.1007/978-3-540-74171-8_22
[6]	Jain A, Chang E Y. Adaptive sampling for sensor networks. Proceedings of the International Workshop on Data Management for Sensor Networks. Toronto, ON, Canada; 2004, 72 p. 10-16. https://orcid.org/10.1145/1052199.1052202
[7]	Stone P, Veloso M M. Team-partitioned, Opaque-transition Reinforcement Learning. Proceedings of the third annual conference on Autonomous Agents; 1999.
[8]	Foerster J, Nardelli N, Farquhar G, Afouras T, Torr P H S, Kohli P, Whiteson S. Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning. ARXIV-CS.AI; 2017.
[9]	Nguyen T T, Nguyen N D, Nahavandi S. Deep Reinforcement Learning For Multi-Agent Systems: A Review Of Challenges, Solutions And Applications. ARXIV-CS.LG; 2018.
[10]	Samvelyan M, Rashid T, Schroeder de Witt C, Farquhar G, Nardelli N, Rudner T G J, Hung C M, Torr P H S, Foerster J, Whiteson S. The StarCraft Multi-Agent Challenge. ARXIV-CS.LG; 2019.
[11]	Chu T, Wang J, Codecà L, Li Z. Multi-Agent Deep Reinforcement Learning For Large-scale Traffic Signal Control [J]. ARXIV-CS.LG, 2019.
[12]	Lowe R, Wu Y I, Tamar A, Harb J, Abbeel P, Mordatch I. Multi-agent actor-critic for mixed cooperative-competitive environments. Advances in neural information processing systems. 2017, 30.
[13]	Wang D, Zhang W, Song B, Du X, Guizani M. Market-Based Model In CR-WSN: A Q-Probabilistic Multi-agent Learning Approach. ARXIV-CS.MA; 2019.
[14]	Zhang K, Yang Z, Başar T. Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms. ARXIV-CS.LG; 2019.
[15]	Wang C, Shen X, Wang H, Xie W, Zhang H, Mei H. Multi-Agent Reinforcement Learning-Based Routing Protocol for Underwater Wireless Sensor Networks with Value of Information. IEEE Sensors Journal. 2023. https://orcid.org/10.1109/JSEN.2023.3345947
[16]	Ren J, et al. MeFi: Mean Field Reinforcement Learning for Cooperative Routing in Wireless Sensor Network. IEEE Internet of Things Journal. 2024 Jan 1; 11(1): 995-1011. https://orcid.org/10.1109/JIOT.2023.3289888.
[17]	Liang Y, Wu H, Wang H. Asynchronous Multi-Agent Reinforcement Learning for Collaborative Partial Charging in Wireless Rechargeable Sensor Networks. IEEE Transactions on Mobile Computing. 2023. https://orcid.org/10.1109/TMC.2023.3299238
[18]	Cao J, Liu J, Dou J, Hu C, Cheng J, Wang S. Multi-Agent Reinforcement Learning Charging Scheme for Underwater Rechargeable Sensor Networks. IEEE Communications Letters. 2023. https://orcid.org/10.1109/LCOMM.2023.3345362
[19]	Zhao M, Wang G, Fu Q, Guo X, Chen Y, Li T, Liu X. MW-MADDPG: a meta-learning based decision-making method for collaborative UAV swarm. Frontiers in Neurorobotics. 2023; 17. https://orcid.org/10.3389/fnbot.2023.1243174
[20]	Liu X, Tan Y. Feudal Latent Space Exploration for Coordinated Multi-Agent Reinforcement Learning. IEEE Transactions on Neural Networks and Learning Systems. 2023 Oct; 34(10): 7775-7783. https://orcid.org/10.1109/TNNLS.2022.3146201

Cite This Article

Plain Text BibTeX RIS

APA Style

Wu, L., Liu, P., Qu, J., Zhang, C., Zhang, B. (2024). Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm. International Journal of Sensors and Sensor Networks, 12(1), 1-12. https://doi.org/10.11648/j.ijssn.20241201.11

Copy | Download

ACS Style

Wu, L.; Liu, P.; Qu, J.; Zhang, C.; Zhang, B. Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm. Int. J. Sens. Sens. Netw. 2024, 12(1), 1-12. doi: 10.11648/j.ijssn.20241201.11

Copy | Download

AMA Style

Wu L, Liu P, Qu J, Zhang C, Zhang B. Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm. Int J Sens Sens Netw. 2024;12(1):1-12. doi: 10.11648/j.ijssn.20241201.11

Copy | Download

@article{10.11648/j.ijssn.20241201.11,
  author = {Liangshun Wu and Peilin Liu and Junsuo Qu and Cong Zhang and Bin Zhang},
  title = {Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm},
  journal = {International Journal of Sensors and Sensor Networks},
  volume = {12},
  number = {1},
  pages = {1-12},
  doi = {10.11648/j.ijssn.20241201.11},
  url = {https://doi.org/10.11648/j.ijssn.20241201.11},
  eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijssn.20241201.11},
  abstract = {This paper presents an in-depth study of the application of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithms with an exploratory strategy for duty cycle scheduling (DCS) in the wireless sensor networks (WSNs). The focus is on optimizing the performance of sensor nodes in terms of energy efficiency and event detection rates under varying environmental conditions. Through a series of simulations, we investigate the impact of key parameters such as the sensor specificity constant α and the Poisson rate of events on the learning and operational efficacy of sensor nodes. Our results demonstrate that the MADDPG algorithm with an exploratory strategy outperforms traditional reinforcement learning algorithms, particularly in environments characterized by high event rates and the need for precise energy management. The exploratory strategy enables a more effective balance between exploration and exploitation, leading to improved policy learning and adaptation in dynamic and uncertain environments. Furthermore, we explore the sensitivity of different algorithms to the tuning of the sensor specificity constant α, revealing that lower values generally yield better performance by reducing energy consumption without significantly compromising event detection. The study also examines the algorithms' robustness against the variability introduced by different event Poisson rates, emphasizing the importance of algorithm selection and parameter tuning in practical WSN applications. The insights gained from this research provide valuable guidelines for the deployment of sensor networks in real-world scenarios, where the trade-off between energy consumption and event detection is critical. Our findings suggest that the integration of exploratory strategies in MADDPG algorithms can significantly enhance the performance and reliability of sensor nodes in WSNs.
},
 year = {2024}
}

Copy | Download

TY  - JOUR
T1  - Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm
AU  - Liangshun Wu
AU  - Peilin Liu
AU  - Junsuo Qu
AU  - Cong Zhang
AU  - Bin Zhang
Y1  - 2024/02/28
PY  - 2024
N1  - https://doi.org/10.11648/j.ijssn.20241201.11
DO  - 10.11648/j.ijssn.20241201.11
T2  - International Journal of Sensors and Sensor Networks
JF  - International Journal of Sensors and Sensor Networks
JO  - International Journal of Sensors and Sensor Networks
SP  - 1
EP  - 12
PB  - Science Publishing Group
SN  - 2329-1788
UR  - https://doi.org/10.11648/j.ijssn.20241201.11
AB  - This paper presents an in-depth study of the application of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithms with an exploratory strategy for duty cycle scheduling (DCS) in the wireless sensor networks (WSNs). The focus is on optimizing the performance of sensor nodes in terms of energy efficiency and event detection rates under varying environmental conditions. Through a series of simulations, we investigate the impact of key parameters such as the sensor specificity constant α and the Poisson rate of events on the learning and operational efficacy of sensor nodes. Our results demonstrate that the MADDPG algorithm with an exploratory strategy outperforms traditional reinforcement learning algorithms, particularly in environments characterized by high event rates and the need for precise energy management. The exploratory strategy enables a more effective balance between exploration and exploitation, leading to improved policy learning and adaptation in dynamic and uncertain environments. Furthermore, we explore the sensitivity of different algorithms to the tuning of the sensor specificity constant α, revealing that lower values generally yield better performance by reducing energy consumption without significantly compromising event detection. The study also examines the algorithms' robustness against the variability introduced by different event Poisson rates, emphasizing the importance of algorithm selection and parameter tuning in practical WSN applications. The insights gained from this research provide valuable guidelines for the deployment of sensor networks in real-world scenarios, where the trade-off between energy consumption and event detection is critical. Our findings suggest that the integration of exploratory strategies in MADDPG algorithms can significantly enhance the performance and reliability of sensor nodes in WSNs.

VL  - 12
IS  - 1
ER  -

Copy | Download

Author Information

Liangshun Wu

School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, China
Peilin Liu

School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, China
Junsuo Qu

Xi'an Key Laboratory of Advanced Control and Intelligent Process, School of Automation, Xi’an University of Posts & Telecommunications, Xi’an, China
Cong Zhang

Xi'an Key Laboratory of Advanced Control and Intelligent Process, School of Automation, Xi’an University of Posts & Telecommunications, Xi’an, China
Bin Zhang

School of Information Network Security, Xinjiang University of Political Science and Law, Tumushuke, Xinjiang, China; Department of Computer, City University of Hong Kong, Hongkong, China; School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, China

Contact Email

Download PDF

Sections

Plain Text BibTeX RIS

APA Style

Wu, L., Liu, P., Qu, J., Zhang, C., Zhang, B. (2024). Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm. International Journal of Sensors and Sensor Networks, 12(1), 1-12. https://doi.org/10.11648/j.ijssn.20241201.11

Copy | Download

ACS Style

Wu, L.; Liu, P.; Qu, J.; Zhang, C.; Zhang, B. Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm. Int. J. Sens. Sens. Netw. 2024, 12(1), 1-12. doi: 10.11648/j.ijssn.20241201.11

Copy | Download

AMA Style

Wu L, Liu P, Qu J, Zhang C, Zhang B. Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm. Int J Sens Sens Netw. 2024;12(1):1-12. doi: 10.11648/j.ijssn.20241201.11

Copy | Download

@article{10.11648/j.ijssn.20241201.11,
  author = {Liangshun Wu and Peilin Liu and Junsuo Qu and Cong Zhang and Bin Zhang},
  title = {Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm},
  journal = {International Journal of Sensors and Sensor Networks},
  volume = {12},
  number = {1},
  pages = {1-12},
  doi = {10.11648/j.ijssn.20241201.11},
  url = {https://doi.org/10.11648/j.ijssn.20241201.11},
  eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijssn.20241201.11},
  abstract = {This paper presents an in-depth study of the application of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithms with an exploratory strategy for duty cycle scheduling (DCS) in the wireless sensor networks (WSNs). The focus is on optimizing the performance of sensor nodes in terms of energy efficiency and event detection rates under varying environmental conditions. Through a series of simulations, we investigate the impact of key parameters such as the sensor specificity constant α and the Poisson rate of events on the learning and operational efficacy of sensor nodes. Our results demonstrate that the MADDPG algorithm with an exploratory strategy outperforms traditional reinforcement learning algorithms, particularly in environments characterized by high event rates and the need for precise energy management. The exploratory strategy enables a more effective balance between exploration and exploitation, leading to improved policy learning and adaptation in dynamic and uncertain environments. Furthermore, we explore the sensitivity of different algorithms to the tuning of the sensor specificity constant α, revealing that lower values generally yield better performance by reducing energy consumption without significantly compromising event detection. The study also examines the algorithms' robustness against the variability introduced by different event Poisson rates, emphasizing the importance of algorithm selection and parameter tuning in practical WSN applications. The insights gained from this research provide valuable guidelines for the deployment of sensor networks in real-world scenarios, where the trade-off between energy consumption and event detection is critical. Our findings suggest that the integration of exploratory strategies in MADDPG algorithms can significantly enhance the performance and reliability of sensor nodes in WSNs.
},
 year = {2024}
}

Copy | Download

TY  - JOUR
T1  - Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm
AU  - Liangshun Wu
AU  - Peilin Liu
AU  - Junsuo Qu
AU  - Cong Zhang
AU  - Bin Zhang
Y1  - 2024/02/28
PY  - 2024
N1  - https://doi.org/10.11648/j.ijssn.20241201.11
DO  - 10.11648/j.ijssn.20241201.11
T2  - International Journal of Sensors and Sensor Networks
JF  - International Journal of Sensors and Sensor Networks
JO  - International Journal of Sensors and Sensor Networks
SP  - 1
EP  - 12
PB  - Science Publishing Group
SN  - 2329-1788
UR  - https://doi.org/10.11648/j.ijssn.20241201.11
AB  - This paper presents an in-depth study of the application of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithms with an exploratory strategy for duty cycle scheduling (DCS) in the wireless sensor networks (WSNs). The focus is on optimizing the performance of sensor nodes in terms of energy efficiency and event detection rates under varying environmental conditions. Through a series of simulations, we investigate the impact of key parameters such as the sensor specificity constant α and the Poisson rate of events on the learning and operational efficacy of sensor nodes. Our results demonstrate that the MADDPG algorithm with an exploratory strategy outperforms traditional reinforcement learning algorithms, particularly in environments characterized by high event rates and the need for precise energy management. The exploratory strategy enables a more effective balance between exploration and exploitation, leading to improved policy learning and adaptation in dynamic and uncertain environments. Furthermore, we explore the sensitivity of different algorithms to the tuning of the sensor specificity constant α, revealing that lower values generally yield better performance by reducing energy consumption without significantly compromising event detection. The study also examines the algorithms' robustness against the variability introduced by different event Poisson rates, emphasizing the importance of algorithm selection and parameter tuning in practical WSN applications. The insights gained from this research provide valuable guidelines for the deployment of sensor networks in real-world scenarios, where the trade-off between energy consumption and event detection is critical. Our findings suggest that the integration of exploratory strategies in MADDPG algorithms can significantly enhance the performance and reliability of sensor nodes in WSNs.

VL  - 12
IS  - 1
ER  -

Copy | Download