Show simple item record

Real-time Distributed Optimization of Traffic Signal Timing

dc.contributor.authorYin, Yafeng
dc.contributor.authorShen, Siqian
dc.contributor.authorFeng, Yiheng
dc.contributor.authorFei, Xinyu
dc.contributor.authorYu, Xian
dc.contributor.authorWang, Xingmin
dc.contributor.authorMi, Tian
dc.date.accessioned2023-02-01T18:17:04Z
dc.date.available2023-02-01T18:17:04Z
dc.date.issued2023-02-01
dc.identifier.citationSuggested APA Format Citation: Yin, Yafeng, Shen, Siqian, Feng, Yiheng, Fei, Xinyu, Yu, Xian, Wang Xingmin, & Mi, Tian, (2021) "Real-time Distributed Optimization of Traffic Signal Timing". CCAT Project No. 13, Center for Connected and Automated Transportation, University of Michigan.en_US
dc.identifier.urihttps://hdl.handle.net/2027.42/175728en
dc.description.abstractLeveraging recent advancements in distributed optimization and reinforcement learning, and the growing connectivity and computational capability of vehicles and infrastructure, we propose to advance real-time adaptive signal control via distributed control and optimization. This report consists of three parts. Part 1 develops distributed algorithms for solving a traffic signal timing optimization problem, which is formulated as a mixed-integer programming model. Specifically, the alternating direction method of multipliers (ADMM) is employed, and a two-stage stochastic cell transmission model (CTM) that considers the uncertainty of traffic demand and vehicle turning ratios is considered. Part 2 proposes a framework that utilizes reinforcement learning to optimize a max pressure controller considering the phase switching loss. The max pressure control is modified by introducing a switching curve, and the proposed control method is proved throughput-optimal in a store-and-forward network. Then the theoretical control policy is extended by using a distributed approximation and position-weighted pressure so that the policy-gradient reinforcement learning algorithms can be utilized to optimize the parameters in the policy network including the switching curve and the weigh curve. Part 3 applies reinforcement learning to traffic signal control in a multi-agent scheme, considering the data availability and implementability. The information extracted from traffic cameras is used to define the state of the agents; the action design is aligned with the NEMA dual-ring convention and bounded by a safety constraint, and the coordination is achieved by a shared reward structure among agents.en_US
dc.description.sponsorshipU.S. Department of Transportation Office of the Assistant Secretary for Research and Technology 1200 New Jersey Avenue, SE Washington, DC 20590 (OST-R)en_US
dc.language.isoen_USen_US
dc.relation.ispartofseriesCCAT Report No. 13en_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjecttraffic signal controlen_US
dc.subjectdistributed optimizationen_US
dc.subjectmaximum pressure controlen_US
dc.subjectreinforcement learningen_US
dc.titleReal-time Distributed Optimization of Traffic Signal Timingen_US
dc.typeTechnical Reporten_US
dc.subject.hlbsecondlevelCivil and Environmental Engineering
dc.subject.hlbtoplevelEngineering
dc.contributor.affiliationumCivil and Environmental Engineering, Department ofen_US
dc.contributor.affiliationotherPurdue Universityen_US
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/175728/1/Real-time Distributed Optimization of Traffic Signal Timing Final Report.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/6942
dc.identifier.orcid0000-0001-7010-8664en_US
dc.identifier.orcid0000-0003-1059-5303en_US
dc.identifier.orcid0000-0003-0435-2786en_US
dc.identifier.orcid0000-0002-3780-3216en_US
dc.identifier.orcid0000-0003-3117-5463en_US
dc.identifier.orcid0000-0002-2854-163Xen_US
dc.identifier.orcid0000-0001-5656-3222en_US
dc.description.filedescriptionDescription of Real-time Distributed Optimization of Traffic Signal Timing Final Report.pdf : Final Report
dc.description.depositorSELFen_US
dc.identifier.name-orcidYu, Xian; 0000-0003-1059-5303en_US
dc.identifier.name-orcidWang, Xingmin; 0000-0003-0435-2786en_US
dc.identifier.name-orcidMi, Tian; 0000-0002-3780-3216en_US
dc.identifier.name-orcidYin, Yafeng; 0000-0003-3117-5463en_US
dc.identifier.name-orcidShen, Siqian; 0000-0002-2854-163Xen_US
dc.identifier.name-orcidFeng, Yiheng; 0000-0001-5656-3222en_US
dc.identifier.name-orcidFei, Xinyu; 0000-0001-7010-8664en_US
dc.working.doi10.7302/6942en_US
dc.owningcollnameCivil & Environmental Engineering (CEE)


Files in this item

Show simple item record

Attribution-NonCommercial-NoDerivatives 4.0 International
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 International

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.