正文內(nèi)容

算法類外文資料翻譯-其他專業(yè)-wenkub

2023-01-30 01:06:59 本頁面

　

【正文】 F 0 0 100 The minus sign in the table says that the row state has no action to go to column state. For example, State A cannot go to state B (because no door connecting room A and B, remember?) In the previous sections of this tutorial, we have modeled the environment and the reward system for our agent. This section will describe learning algorithm called Q learning (which is a simplification of reinforcement learning). We have model the environment reward system as matrix R. 5 Now we need to put similar matrix name Q in the brain of our agent that will represent the memory of what the agent have learned through many experiences. The row of matrix Q represents current state of the agent, the column of matrix Q pointing to the action to go to the next state. In the beginning, we say that the agent know nothing, thus we put Q as zero matrix. In this example, for the simplicity of explanation, we assume the number of state is known (to be six). In more general case, you can start with zero matrix of single cell. It is a simple task to add more column and rows in Q matrix if a new state is found. The transition rule of this Q learning is a very simple formula The formula above have meaning that the entry value in matrix Q (that is row represent state and column represent action) is equal to corresponding entry of matrix R added by a multiplication of a learning parameter and maximum value of Q for all action in the next state. Our virtual agent will learn through experience without teacher (this is called unsupervised learning). The agent will explore state to state until it reaches the goal. We call each exploration as an episode . In one episode the agent

點(diǎn)擊復(fù)制文檔內(nèi)容

環(huán)評(píng)公示相關(guān)推薦

freepeople性欧美熟妇, 色戒完整版无删减158分钟hd, 无码精品国产vα在线观看DVD, 丰满少妇伦精品无码专区在线观看,艾栗栗与纹身男宾馆3p50分钟,国产AV片在线观看,黑人与美女高潮,18岁女RAPPERDISSSUBS,国产手机在机看影片

算法類外文資料翻譯-其他專業(yè)-wenkub

外文翻譯-機(jī)床實(shí)踐-其他專業(yè)-資料下載頁

倉儲(chǔ)管理【外文翻譯】-其他專業(yè)-資料下載頁

股利政策外文翻譯-其他專業(yè)-資料下載頁

鋼板彈簧外文翻譯-其他專業(yè)-資料下載頁

成本控制外文翻譯-其他專業(yè)-資料下載頁

氣壓傳動(dòng)外文翻譯-其他專業(yè)-資料下載頁

網(wǎng)頁設(shè)計(jì)外文翻譯-其他專業(yè)-資料下載頁

繼電保護(hù)外文翻譯-其他專業(yè)-資料下載頁

組合鏜床外文翻譯-其他專業(yè)-資料下載頁

進(jìn)程管理外文翻譯-其他專業(yè)-資料下載頁

空調(diào)系統(tǒng)外文翻譯-其他專業(yè)-資料下載頁

水處理外文翻譯-其他專業(yè)-資料下載頁

機(jī)械車床外文翻譯-其他專業(yè)-資料下載頁

綠色包裝外文翻譯-其他專業(yè)-資料下載頁

倒立擺外文翻譯-其他專業(yè)-資料下載頁

算法類外文資料翻譯-其他專業(yè)-wenkub.com

算法類外文資料翻譯-其他專業(yè)(已改無錯(cuò)字)

算法類外文資料翻譯-其他專業(yè)-資料下載頁

算法類外文資料翻譯-其他專業(yè)(參考版)

算法類外文資料翻譯-其他專業(yè)-文庫吧資料