دانلود کتاب Zero-Sum Discrete-Time Markov Games With Unknown Disturbance Distribution – Discounted And Average Criteria, 2020

آگوست 15, 2021

نام کتاب: Zero-Sum Discrete-Time Markov Games With Unknown Disturbance Distribution – Discounted And Average Criteria

نویسنده: J. Adolfo Minjárez-Sosa

ویرایش: ۱

سال انتشار: ۲۰۲۰

کد ISBN کتاب: ۳۰۳۰۳۵۷۱۹۸, ۹۷۸۳۰۳۰۳۵۷۱۹۱,

فرمت: PDF

تعداد صفحه: ۱۲۰

حجم کتاب: ۱ مگابایت

کیفیت کتاب: OCR

انتشارات: Springer

Description About Book Zero-Sum Discrete-Time Markov Games With Unknown Disturbance Distribution – Discounted And Average Criteria From Amazon

This SpringerBrief deals with a class of discrete-time zero-sum Markov games with Borel state and action spaces, and possibly unbounded payoffs, under discounted and average criteria, whose state process evolves according to a stochastic difference equation. The corresponding disturbance process is an observable sequence of independent and identically distributed random variables with unknown distribution for both players. Unlike the standard case, the game is played over an infinite horizon evolving as follows. At each stage, once the players have observed the state of the game, and before choosing the actions, players 1 and 2 implement a statistical estimation process to obtain estimates of the unknown distribution. Then, independently, the players adapt their decisions to such estimators to select their actions and construct their strategies. This book presents a systematic analysis on recent developments in this kind of games. Specifically, the theoretical foundations on the procedures combining statistical estimation and control techniques for the construction of strategies of the players are introduced, with illustrative examples. In this sense, the book is an essential reference for theoretical and applied researchers in the fields of stochastic control and game theory, and their applications.

درباره کتاب Zero-Sum Discrete-Time Markov Games With Unknown Disturbance Distribution – Discounted And Average Criteria ترجمه شده از گوگل

این SpringerBrief به دسته ای از بازیهای مارکف با زمان صفر با زمان مجزا با فضاهای حالت و عملکرد بورل و احتمالاً بازدهی نامحدود ، با معیارهای تخفیف داده شده و متوسط می پردازد ، که فرایند حالت آنها بر اساس یک معادله تفاوت تصادفی تکامل می یابد. فرایند اختلال مربوطه یک دنباله قابل مشاهده از متغیرهای تصادفی مستقل و توزیع شده با توزیع ناشناخته برای هر دو بازیکن است. بر خلاف حالت استاندارد ، بازی در افق بی نهایت انجام می شود که به شرح زیر در حال تکامل است. در هر مرحله ، هنگامی که بازیکنان وضعیت بازی را مشاهده کردند ، و قبل از انتخاب اقدامات ، بازیکنان ۱ و ۲ یک فرایند تخمین آماری را برای بدست آوردن برآورد توزیع ناشناخته اجرا می کنند. سپس ، به طور مستقل ، بازیکنان تصمیمات خود را با چنین برآوردگرهایی تطبیق می دهند تا اقدامات خود را انتخاب کرده و استراتژی های خود را بسازند. این کتاب تجزیه و تحلیل سیستماتیک در مورد پیشرفت های اخیر در این نوع بازی ها ارائه می دهد. به طور خاص ، مبانی نظری رویه های ترکیبی از تکنیک های برآورد آماری و کنترل برای ساخت استراتژی بازیکنان ، با مثالهای گویا معرفی شده است. از این نظر ، این کتاب یک مرجع اساسی برای محققان نظری و کاربردی در زمینه های کنترل تصادفی و نظریه بازی و کاربردهای آنها است.

[box type=”info”] جهت دسترسی به توضیحات این کتاب در Amazon اینجا کلیک کنید.

در صورت خراب بودن لینک کتاب، در قسمت نظرات همین مطلب گزارش دهید.