Suggestion of some courses in sequential decision makingOpen source Markov decision process solversReference request: how to model nonlinear regression?What is the connection of Operations Research and Reinforcement Learning?Single reference for Mixed Integer Programming formulations to linearize, handle logical constraints and disjunctive constraints, do Big M, etc?Recommended books/materials for practical applications of Operations Research in industryReal-world examples of supply chain contracts?

I make billions (#6)

Is it better in terms of durability to remove card+battery or to connect to charger/computer via USB-C?

Optimization models for portfolio optimization

Is it ok for parents to kiss and romance with each other while their 2- to 8-year-old child watches?

Non-Chromatic Orchestral Instruments?

When I press the space bar it deletes the letters in front of it

How do you move up one folder in Finder?

How does one acquire an undead eyeball encased in a gem?

Is it stylistically sound to use onomatopoeic words?

Did Rabbi Akiva accept arguments from ignorance?

My previous employer committed a severe violation of the law and is also being sued by me. How do I explain the situation to future employers?

Is there a strong legal guarantee that the U.S. can give to another country that it won't attack them?

How to properly translate "kusura bakmasınlar" ("the fault should not be looked at") to Russian?

Who buys a weak currency?

What are the consequences for a developed nation to not accept any refugees?

Why is a mixture of two normally distributed variables only bimodal if their means differ by at least two times the common standard deviation?

What happens to unproductive professors?

Why did Dumbledore ignore this line?

Correct notation for guitar fingerstyle

Why does Trump want a citizenship question on the census?

Found and corrected a mistake on someone's else paper -- praxis?

Conditions for Roots of a quadratic equation at infinity

How should I ask for a "pint" in countries that use metric?

Red token deck mass token destruction enchantment protection mtg



Suggestion of some courses in sequential decision making


Open source Markov decision process solversReference request: how to model nonlinear regression?What is the connection of Operations Research and Reinforcement Learning?Single reference for Mixed Integer Programming formulations to linearize, handle logical constraints and disjunctive constraints, do Big M, etc?Recommended books/materials for practical applications of Operations Research in industryReal-world examples of supply chain contracts?













10












$begingroup$


I am studying about sequential decision making and I am willing to know if there is any course which is recorded and is publically available covering topics in dynamic programming (DP), reinforcement learning (RL), bandit problem, approximate DPRL, online optimization?



Thanks










share|improve this question











$endgroup$
















    10












    $begingroup$


    I am studying about sequential decision making and I am willing to know if there is any course which is recorded and is publically available covering topics in dynamic programming (DP), reinforcement learning (RL), bandit problem, approximate DPRL, online optimization?



    Thanks










    share|improve this question











    $endgroup$














      10












      10








      10





      $begingroup$


      I am studying about sequential decision making and I am willing to know if there is any course which is recorded and is publically available covering topics in dynamic programming (DP), reinforcement learning (RL), bandit problem, approximate DPRL, online optimization?



      Thanks










      share|improve this question











      $endgroup$




      I am studying about sequential decision making and I am willing to know if there is any course which is recorded and is publically available covering topics in dynamic programming (DP), reinforcement learning (RL), bandit problem, approximate DPRL, online optimization?



      Thanks







      reference-request online-resources reinforcement-learning sequential-decision-making dynamic-programming






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Jul 1 at 11:36









      Marcus Ritt

      2,0565 silver badges29 bronze badges




      2,0565 silver badges29 bronze badges










      asked Jun 30 at 8:31









      Amin ShAmin Sh

      3581 silver badge7 bronze badges




      3581 silver badge7 bronze badges




















          2 Answers
          2






          active

          oldest

          votes


















          9












          $begingroup$

          There are a few courses on Coursera that offer such learning materials.




          • Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)


          The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).




          If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.




          • Practical Reinforcement Learning (Advanced)


          Here you will find out about:



          • foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included


          • using deep neural networks for RL tasks - also known as "the hype train"


          • state of the art RL algorithms - and how to apply duct tape to them for practical problems.


          • and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.







          share|improve this answer









          $endgroup$




















            -1












            $begingroup$

            AFAIK, There are some examples on the YouTube host. For instance, this link.






            share|improve this answer









            $endgroup$












            • $begingroup$
              This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
              $endgroup$
              – Discrete lizard
              Jul 3 at 8:52













            Your Answer








            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "700"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            noCode: true, onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );













            draft saved

            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2for.stackexchange.com%2fquestions%2f831%2fsuggestion-of-some-courses-in-sequential-decision-making%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            2 Answers
            2






            active

            oldest

            votes








            2 Answers
            2






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            9












            $begingroup$

            There are a few courses on Coursera that offer such learning materials.




            • Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)


            The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).




            If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.




            • Practical Reinforcement Learning (Advanced)


            Here you will find out about:



            • foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included


            • using deep neural networks for RL tasks - also known as "the hype train"


            • state of the art RL algorithms - and how to apply duct tape to them for practical problems.


            • and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.







            share|improve this answer









            $endgroup$

















              9












              $begingroup$

              There are a few courses on Coursera that offer such learning materials.




              • Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)


              The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).




              If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.




              • Practical Reinforcement Learning (Advanced)


              Here you will find out about:



              • foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included


              • using deep neural networks for RL tasks - also known as "the hype train"


              • state of the art RL algorithms - and how to apply duct tape to them for practical problems.


              • and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.







              share|improve this answer









              $endgroup$















                9












                9








                9





                $begingroup$

                There are a few courses on Coursera that offer such learning materials.




                • Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)


                The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).




                If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.




                • Practical Reinforcement Learning (Advanced)


                Here you will find out about:



                • foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included


                • using deep neural networks for RL tasks - also known as "the hype train"


                • state of the art RL algorithms - and how to apply duct tape to them for practical problems.


                • and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.







                share|improve this answer









                $endgroup$



                There are a few courses on Coursera that offer such learning materials.




                • Greedy Algorithms, Minimum Spanning Trees, and Dynamic Programming (Intermediate)


                The primary topics in this part of the specialization are: greedy algorithms (scheduling, minimum spanning trees, clustering, Huffman codes) and dynamic programming (knapsack, sequence alignment, optimal search trees).




                If you want to go directly to dynamic programming then you can skip to weeks 3 and 4 of the syllabus.




                • Practical Reinforcement Learning (Advanced)


                Here you will find out about:



                • foundations of RL methods: value/policy iteration, q-learning, policy gradient, etc. - with math & batteries included


                • using deep neural networks for RL tasks - also known as "the hype train"


                • state of the art RL algorithms - and how to apply duct tape to them for practical problems.


                • and, of course, teaching your neural network to play games - because that's what everyone thinks RL is about. We'll also use it for seq2seq and contextual bandits.








                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered Jun 30 at 10:06









                TheSimpliFireTheSimpliFire

                8893 silver badges25 bronze badges




                8893 silver badges25 bronze badges





















                    -1












                    $begingroup$

                    AFAIK, There are some examples on the YouTube host. For instance, this link.






                    share|improve this answer









                    $endgroup$












                    • $begingroup$
                      This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
                      $endgroup$
                      – Discrete lizard
                      Jul 3 at 8:52















                    -1












                    $begingroup$

                    AFAIK, There are some examples on the YouTube host. For instance, this link.






                    share|improve this answer









                    $endgroup$












                    • $begingroup$
                      This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
                      $endgroup$
                      – Discrete lizard
                      Jul 3 at 8:52













                    -1












                    -1








                    -1





                    $begingroup$

                    AFAIK, There are some examples on the YouTube host. For instance, this link.






                    share|improve this answer









                    $endgroup$



                    AFAIK, There are some examples on the YouTube host. For instance, this link.







                    share|improve this answer












                    share|improve this answer



                    share|improve this answer










                    answered Jul 2 at 8:18









                    abbas omidiabbas omidi

                    3235 bronze badges




                    3235 bronze badges











                    • $begingroup$
                      This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
                      $endgroup$
                      – Discrete lizard
                      Jul 3 at 8:52
















                    • $begingroup$
                      This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
                      $endgroup$
                      – Discrete lizard
                      Jul 3 at 8:52















                    $begingroup$
                    This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
                    $endgroup$
                    – Discrete lizard
                    Jul 3 at 8:52




                    $begingroup$
                    This is a link to a search term of the main words in the title. While this could be a decent suggestion, I don't think this is a proper answer. You can improve your answer by seeing what this link finds you and providing a brief summary of the material you found, similar to the other answer.
                    $endgroup$
                    – Discrete lizard
                    Jul 3 at 8:52

















                    draft saved

                    draft discarded
















































                    Thanks for contributing an answer to Operations Research Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2for.stackexchange.com%2fquestions%2f831%2fsuggestion-of-some-courses-in-sequential-decision-making%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Category:9 (number) SubcategoriesMedia in category "9 (number)"Navigation menuUpload mediaGND ID: 4485639-8Library of Congress authority ID: sh85091979ReasonatorScholiaStatistics

                    Circuit construction for execution of conditional statements using least significant bitHow are two different registers being used as “control”?How exactly is the stated composite state of the two registers being produced using the $R_zz$ controlled rotations?Efficiently performing controlled rotations in HHLWould this quantum algorithm implementation work?How to prepare a superposed states of odd integers from $1$ to $sqrtN$?Why is this implementation of the order finding algorithm not working?Circuit construction for Hamiltonian simulationHow can I invert the least significant bit of a certain term of a superposed state?Implementing an oracleImplementing a controlled sum operation

                    Magento 2 “No Payment Methods” in Admin New OrderHow to integrate Paypal Express Checkout with the Magento APIMagento 1.5 - Sales > Order > edit order and shipping methods disappearAuto Invoice Check/Money Order Payment methodAdd more simple payment methods?Shipping methods not showingWhat should I do to change payment methods if changing the configuration has no effects?1.9 - No Payment Methods showing upMy Payment Methods not Showing for downloadable/virtual product when checkout?Magento2 API to access internal payment methodHow to call an existing payment methods in the registration form?