科学合理的交通信号控制方案通过优化路权分配, 有效减少车辆等待时间, 提升交叉口的通行效率, 从而实现更高效的道路资源利用。随着强化学习和人工智能技术的持续发展, 交通信号灯控制算法得到了不断改进和优化。然而, 以往基于强化学习的信控方案优化模型研究大多侧重于智能体的学习过程改进, 较少关注信号控制方案的输入。为更有效地提高交叉口通行效率, 本研究提出了一种结合专家经验的深度Q网络 (Deep Q Network, DQN) 信号控制优化算法。该算法首先设计定时信号配时方案, 然后在微观交通仿真环境SUMO中利用DQN算法进行训练, 以获得交叉口的最优信号控制执行方案。由于定时信号配时方案是基于交叉口的道路条件、流向及实际车流量计算得到的, 输入的信号控制方案更为合理。本研究在华穗路与花城大道交叉口开展实验, 得出了以下结果: 与现实信号控制方案相比, DQN算法和结合专家经验的DQN优化算法均将交叉口的平均速度提高了7. 9%; DQN算法将平均等待车辆数量降低了23. 1%, 结合专家经验的DQN优化算法将其降低了69. 2%。实验结果表明, 应用这两种优化算法都能够有效提升交叉口的通行效率, 其中结合专家经验的DQN信号控制优化算法在所有算法中表现最佳。
A scientifically reasonable traffic signal control scheme optimizes the allocation of road rights, effectively reduces vehicle waiting time, improves intersection traffic efficiency, and achieves more efficient utilization of road resources. With the continuous development of reinforcement learning and artificial intelligence technologies, traffic signal light control algorithms have been constantly improved and optimized. However, previous studies on optimization models of signal control schemes based on reinforcement learning mostly focused on the improvement of the learning process of agents, and paid less attention to the input of signal control schemes. To improve the traffic efficiency of intersections more effectively, this study proposes a Deep Q Network (DQN) signal control optimization algorithm combined with expert experience. This algorithm first designs a timing signal timing scheme, and then uses the DQN algorithm for training in the microscopic traffic simulation environment SUMO to obtain the optimal signal control execution scheme of the intersection. Since the traffic signal timing scheme is calculated based on the road conditions, flow direction and actual traffic volume of the intersection, the input signal control scheme is more reasonable. This study conducted experiments at the intersection of Huasui Road and Huacheng Avenue and obtained the following results: Compared with the actual signal control scheme, both the DQN algorithm and the DQN optimization algorithm combined with expert experience increased the average speed of the intersection by 7. 9%; The DQN algorithm reduced the average number of waiting vehicles by 23. 1%, and the DQN optimization algorithm combined with expert experience reduced it by 69. 2%. The experimental results show that the application of both of these two optimization algorithms can effectively improve the traffic efficiency of intersections. Among them, the DQN signal control optimization algorithm combined with expert experience performs the best among all algorithms.