A Task-oriented Dialogue Policy Learning Method of Improved Discriminative Deep Dyna-Q
Dai Bin, Zeng Bi, Wei Peng-fei, Huang Yong-jian
Journal of Guangdong University of Technology . 2023, (04): 9 -17,23 .  DOI: 10.12052/gdutxb.220122