混合交通中高速公路入口匝道合并协同驾驶决策研究

    Research on Cooperative Driving Decisions for Highway On-ramp Merging in Mixed Traffic

    • 摘要: 在智能网联汽车(Connected Autonomous Vehicle, CAV)与人类驾驶车辆(Human Driving Vehicle, HDV)共存的混合交通中,高速公路入口匝道合并问题具有挑战性。涉及不同类型车辆的道路争夺问题会对交通流产生影响,车辆可能在道路上争夺位置,包括合并、变道等行为,导致CAV难以准确预测和适应其行为,增加了合并的风险,导致交通效率下降,并引发交通拥堵。传统的强化学习算法在复杂环境中难以有效地搜索到优秀的策略,并容易陷入局部最优解,无法有效应对复杂的交通情况,导致合并决策不够精准。针对上述问题,提出了ESACD(Evolutionary Soft Actor-critic for Discrete Action Settings)算法,通过CAV协作适应HDV的策略以最大化交通吞吐量。首先,提出了基于排名选择的父代选择与交叉互换方法,对交互种群进行建模。其次,设计了基于多种群的弹性训练种群,提高CAV应对动态变化的交通流量的适应性。最后,提出了基于适应度评估的二次考核机制。通过在两种不同的交通密度下进行仿真实验,实验结果表明,与传统的演员评论家(Soft Actor-critic, SAC)算法相比,采用该算法能够更高效地完成车联网在入口匝道合并任务,综合提升率较为显著。这验证了该算法能够提升训练效率,扩大交通吞吐量。

       

      Abstract: In the mixed traffic environment where Connected Autonomous Vehicles (CAVs) and Human Driving Vehicles (HDV) coexist, the Highway On-Ramp Merging Problem presents challenges. The road contention issue involving different types of vehicles usually impacts traffic flow. Vehicles may contend for positions on the road, including merging, lane changing, and other behaviors, leading to the challenges of accurately predicting and adapting to their actions for CAVs. This increases the risk of merging, resulting in decreased traffic efficiency and traffic congestion. Traditional reinforcement learning algorithms have difficulty in effectively searching for optimal strategies in complex environments, and they are prone to getting stuck in local optima. They are unable to effectively deal with complex traffic situations, leading to imprecise merging decisions. To address these challenges, the Evolutionary Soft Actor-Critic for Discrete Action Settings (ESACD) algorithm is proposed. It maximizes the traffic throughput by adaptively coordinating CAVs to HDV strategies. Firstly, a Rank Selection-based Parent Selection and Crossover Method is introduced to model the interaction population. Secondly, a Multiple Populations with Elastic Training method is designed to enhance CAV adaptability to the changes of the dynamic traffic flow. Finally, a Fitness Evaluation-based Secondary Assessment Mechanism is proposed. Simulation experiments conducted under two different traffic densities demonstrate that the proposed algorithm more efficiently completes the merging task at highway on-ramps for connected vehicles with a significant overall improvement rate when compared with the traditional Soft Actor-Critic (SAC) algorithm. This validates the training efficiency of the proposed algorithm with expanding the traffic throughput.

       

    /

    返回文章
    返回