Q-learning by the nth step state and multi-agent negotiation in unknown environment

Job, Josip; Jović, Franjo; Livada, Časlav

Tehnički vjesnik, Vol. 19 No. 3, 2012.

Izvorni znanstveni članak

Q-learning by the nth step state and multi-agent negotiation in unknown environment

Josip Job ; Faculty of Electrical Engineering, J. J. Strossmayer University of Osijek, Cara Hadrijana bb, 31000 Osijek, Croatia
Franjo Jović ; Faculty of Electrical Engineering, J. J. Strossmayer University of Osijek, Cara Hadrijana bb, 31000 Osijek, Croatia
Časlav Livada ; Faculty of Electrical Engineering, J. J. Strossmayer University of Osijek, Cara Hadrijana bb, 31000 Osijek, Croatia

Puni tekst: hrvatski pdf 608 Kb

str. 529-534

preuzimanja: 859

citiraj

APA 6th Edition

Job, J., Jović, F. i Livada, Č. (2012). Q-learning by the nth step state and multi-agent negotiation in unknown environment. Tehnički vjesnik, 19 (3), 529-534. Preuzeto s https://hrcak.srce.hr/86725

MLA 8th Edition

Job, Josip, et al. "Q-learning by the nth step state and multi-agent negotiation in unknown environment." Tehnički vjesnik, vol. 19, br. 3, 2012, str. 529-534. https://hrcak.srce.hr/86725. Citirano 19.04.2024.

Chicago 17th Edition

Job, Josip, Franjo Jović i Časlav Livada. "Q-learning by the nth step state and multi-agent negotiation in unknown environment." Tehnički vjesnik 19, br. 3 (2012): 529-534. https://hrcak.srce.hr/86725

Harvard

Job, J., Jović, F., i Livada, Č. (2012). 'Q-learning by the nth step state and multi-agent negotiation in unknown environment', Tehnički vjesnik, 19(3), str. 529-534. Preuzeto s: https://hrcak.srce.hr/86725 (Datum pristupa: 19.04.2024.)

Vancouver

Job J, Jović F, Livada Č. Q-learning by the nth step state and multi-agent negotiation in unknown environment. Tehnički vjesnik [Internet]. 2012 [pristupljeno 19.04.2024.];19(3):529-534. Dostupno na: https://hrcak.srce.hr/86725

IEEE

J. Job, F. Jović i Č. Livada, "Q-learning by the nth step state and multi-agent negotiation in unknown environment", Tehnički vjesnik, vol.19, br. 3, str. 529-534, 2012. [Online]. Dostupno na: https://hrcak.srce.hr/86725. [Citirano: 19.04.2024.]

Puni tekst: engleski pdf 608 Kb

str. 529-534

preuzimanja: 327

citiraj

APA 6th Edition

MLA 8th Edition

Chicago 17th Edition

Harvard

Vancouver

IEEE

Sažetak

This work will show a new procedure of Q-learning in which the agent’s decision, regarding the next step, is not based on the optimal action at that moment but on the usefulness of a future state. A near agent communication has been implemented so that the agents signal each other their future actions which contribute to a better choice of actions for each of the agents. The new method is named Q-learning by the nth step and multi-agent negotiation. The results of the testing of this algorithm are compared with the basic QL algorithm which is also graphically demonstrated and the advantages of the new algorithm are listed too. An average of 40 % collision decrease is obtained during learning procedure.

Ključne riječi

agent; learning from reward and punishment; q-learning; reinforcement learning

Hrčak ID:

86725

URI

https://hrcak.srce.hr/86725

Datum izdavanja:

19.9.2012.

Podaci na drugim jezicima: hrvatski

Posjeta: 2.240 *

Prijava i registracija

Tehnički vjesnik, Vol. 19 No. 3, 2012.

Sažetak

Ključne riječi

Hrčak ID:

URI

Datum izdavanja: