Suggestion of some courses in sequential decision makingOpen source Markov decision process solversReference request: how to model nonlinear regression?What is the connection of Operations Research and Reinforcement Learning?Single reference for Mixed Integer Programming formulations to linearize, handle logical constraints and disjunctive constraints, do Big M, etc?Recommended books/materials for practical applications of Operations Research in industryReal-world examples of supply chain contracts?

I make billions (#6)

Is it better in terms of durability to remove card+battery or to connect to charger/computer via USB-C?

Optimization models for portfolio optimization

Is it ok for parents to kiss and romance with each other while their 2- to 8-year-old child watches?

Non-Chromatic Orchestral Instruments?

When I press the space bar it deletes the letters in front of it

How do you move up one folder in Finder?

How does one acquire an undead eyeball encased in a gem?

Is it stylistically sound to use onomatopoeic words?

Did Rabbi Akiva accept arguments from ignorance?

My previous employer committed a severe violation of the law and is also being sued by me. How do I explain the situation to future employers?

Is there a strong legal guarantee that the U.S. can give to another country that it won't attack them?

How to properly translate "kusura bakmasınlar" ("the fault should not be looked at") to Russian?

Who buys a weak currency?

What are the consequences for a developed nation to not accept any refugees?

Why is a mixture of two normally distributed variables only bimodal if their means differ by at least two times the common standard deviation?

What happens to unproductive professors?

Why did Dumbledore ignore this line?

Correct notation for guitar fingerstyle

Why does Trump want a citizenship question on the census?

Found and corrected a mistake on someone's else paper -- praxis?

Conditions for Roots of a quadratic equation at infinity

How should I ask for a "pint" in countries that use metric?

Red token deck mass token destruction enchantment protection mtg

Suggestion of some courses in sequential decision making

Open source Markov decision process solversReference request: how to model nonlinear regression?What is the connection of Operations Research and Reinforcement Learning?Single reference for Mixed Integer Programming formulations to linearize, handle logical constraints and disjunctive constraints, do Big M, etc?Recommended books/materials for practical applications of Operations Research in industryReal-world examples of supply chain contracts?

I am studying about sequential decision making and I am willing to know if there is any course which is recorded and is publically available covering topics in dynamic programming (DP), reinforcement learning (RL), bandit problem, approximate DPRL, online optimization?

Thanks

edited Jul 1 at 11:36

Marcus Ritt

2,0565 silver badges29 bronze badges

asked Jun 30 at 8:31

Amin Sh

3581 silver badge7 bronze badges

add a comment |

Thanks

edited Jul 1 at 11:36

Marcus Ritt

2,0565 silver badges29 bronze badges

asked Jun 30 at 8:31

Amin Sh

3581 silver badge7 bronze badges

add a comment |

Thanks

edited Jul 1 at 11:36

Marcus Ritt

2,0565 silver badges29 bronze badges

asked Jun 30 at 8:31

Amin Sh

3581 silver badge7 bronze badges

Thanks

reference-request online-resources reinforcement-learning sequential-decision-making dynamic-programming

edited Jul 1 at 11:36

Marcus Ritt

2,0565 silver badges29 bronze badges

asked Jun 30 at 8:31

Amin Sh

3581 silver badge7 bronze badges

edited Jul 1 at 11:36

Marcus Ritt

2,0565 silver badges29 bronze badges

asked Jun 30 at 8:31

Amin Sh

3581 silver badge7 bronze badges

edited Jul 1 at 11:36

Marcus Ritt

2,0565 silver badges29 bronze badges

edited Jul 1 at 11:36

Marcus Ritt

2,0565 silver badges29 bronze badges

edited Jul 1 at 11:36

Marcus Ritt

2,0565 silver badges29 bronze badges

asked Jun 30 at 8:31

Amin Sh

3581 silver badge7 bronze badges

asked Jun 30 at 8:31

Amin Sh

3581 silver badge7 bronze badges

asked Jun 30 at 8:31

Amin Sh

3581 silver badge7 bronze badges

add a comment |

2 Answers
2

active

oldest

votes

There are a few courses on Coursera that offer such learning materials.

Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)

The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).

If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.

Practical Reinforcement Learning (Advanced)

Here you will find out about:

foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included

using deep neural networks for RL tasks - also known as "the hype train"

state of the art RL algorithms - and how to apply duct tape to them for practical problems.

and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.

answered Jun 30 at 10:06

TheSimpliFire

8893 silver badges25 bronze badges

add a comment |

-1

AFAIK, There are some examples on the YouTube host. For instance, this link.

answered Jul 2 at 8:18

abbas omidi

3235 bronze badges

$begingroup$
This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
$endgroup$
– Discrete lizard
Jul 3 at 8:52

add a comment |

Your Answer

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "700"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2for.stackexchange.com%2fquestions%2f831%2fsuggestion-of-some-courses-in-sequential-decision-making%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

There are a few courses on Coursera that offer such learning materials.

Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)

The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).

If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.

Practical Reinforcement Learning (Advanced)

Here you will find out about:

foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included

using deep neural networks for RL tasks - also known as "the hype train"

state of the art RL algorithms - and how to apply duct tape to them for practical problems.

and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.

answered Jun 30 at 10:06

TheSimpliFire

8893 silver badges25 bronze badges

add a comment |

There are a few courses on Coursera that offer such learning materials.

Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)

The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).

If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.

Practical Reinforcement Learning (Advanced)

Here you will find out about:

foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included

using deep neural networks for RL tasks - also known as "the hype train"

state of the art RL algorithms - and how to apply duct tape to them for practical problems.

and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.

answered Jun 30 at 10:06

TheSimpliFire

8893 silver badges25 bronze badges

add a comment |

There are a few courses on Coursera that offer such learning materials.

Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)

The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).

If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.

Practical Reinforcement Learning (Advanced)

Here you will find out about:

foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included

using deep neural networks for RL tasks - also known as "the hype train"

state of the art RL algorithms - and how to apply duct tape to them for practical problems.

and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.

answered Jun 30 at 10:06

TheSimpliFire

8893 silver badges25 bronze badges

There are a few courses on Coursera that offer such learning materials.

Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)

The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).

If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.

Practical Reinforcement Learning (Advanced)

Here you will find out about:

foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included

using deep neural networks for RL tasks - also known as "the hype train"

state of the art RL algorithms - and how to apply duct tape to them for practical problems.

and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.

answered Jun 30 at 10:06

TheSimpliFire

8893 silver badges25 bronze badges

answered Jun 30 at 10:06

TheSimpliFire

8893 silver badges25 bronze badges

answered Jun 30 at 10:06

TheSimpliFire

8893 silver badges25 bronze badges

answered Jun 30 at 10:06

TheSimpliFire

8893 silver badges25 bronze badges

add a comment |

-1

AFAIK, There are some examples on the YouTube host. For instance, this link.

answered Jul 2 at 8:18

abbas omidi

3235 bronze badges

$begingroup$
This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
$endgroup$
– Discrete lizard
Jul 3 at 8:52

add a comment |

-1

AFAIK, There are some examples on the YouTube host. For instance, this link.

answered Jul 2 at 8:18

abbas omidi

3235 bronze badges

$begingroup$
This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
$endgroup$
– Discrete lizard
Jul 3 at 8:52

add a comment |

-1

AFAIK, There are some examples on the YouTube host. For instance, this link.

answered Jul 2 at 8:18

abbas omidi

3235 bronze badges

AFAIK, There are some examples on the YouTube host. For instance, this link.

answered Jul 2 at 8:18

abbas omidi

3235 bronze badges

answered Jul 2 at 8:18

abbas omidi

3235 bronze badges

answered Jul 2 at 8:18

abbas omidi

3235 bronze badges

answered Jul 2 at 8:18

abbas omidi

3235 bronze badges

$begingroup$
This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
$endgroup$
– Discrete lizard
Jul 3 at 8:52

add a comment |

$begingroup$
This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
$endgroup$
– Discrete lizard
Jul 3 at 8:52

This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.

– Discrete lizard
Jul 3 at 8:52

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Operations Research Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Ttdfjt

2 Answers
2

Your Answer

Post as a guest

2 Answers
2

2 Answers
2

Post as a guest

Popular posts from this blog

Category:9 (number) SubcategoriesMedia in category "9 (number)"Navigation menuUpload mediaGND ID: 4485639-8Library of Congress authority ID: sh85091979ReasonatorScholiaStatistics

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

2 Answers 2

2 Answers 2

Sign up or log in

Post as a guest

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Popular posts from this blog

Category:9 (number) SubcategoriesMedia in category "9 (number)"Navigation menuUpload mediaGND ID: 4485639-8Library of Congress authority ID: sh85091979ReasonatorScholiaStatistics

2 Answers
2

2 Answers
2

2 Answers
2