3. Consider a home energy storage (battery) system that can store up to 2 units of energy. At every time step, there is a demand for energy in the home which is drawn from 0, 1, 2 units with equal probability independent of the demands in the previous time steps. At every point in time, you have to satisfy the demand either by discharging the needed energy from the battery or purchasing power from the grid (or a combination of the two). You could also choose to purchase power from the grid to charge your battery. The grid energy price is either H(igh) or L(ow) according to a Markov chain (Price moves from H to L with probability p, and from L to H with probability q). (a) Model the decision making as an infinite horizon MDP where the objective is to minimize the discounted cost of energy purchased over an infinite horizon. (b) Write down a policy 7 of your choosing. Perform two steps of the operator T, for your policy, followed by one step of T,.

Microeconomic Theory
12th Edition
ISBN:9781337517942
Author:NICHOLSON
Publisher:NICHOLSON
Chapter17: Capital And Time
Section: Chapter Questions
Problem 17.8P
icon
Related questions
Question
3. Consider a home energy storage (battery) system that can store up to 2 units of energy. At every time step, there
is a demand for energy in the home which is drawn from 0,1,2 units with equal probability independent of the
demands in the previous time steps. At every point in time, you have to satisfy the demand either by discharging
the needed energy from the battery or purchasing power from the grid (or a combination of the two). You could
also choose to purchase power from the grid to charge your battery. The grid energy price is either H(igh) or L(ow)
according to a Markov chain (Price moves from H to L with probability p, and from L to H with probability q).
(a) Model the decision making as an infinite horizon MDP where the objective is to minimize the discounted cost
of energy purchased over an infinite horizon.
(b) Write down a policy a of your choosing. Perform two steps of the operator T, for your policy, followed by one
step of T..
Transcribed Image Text:3. Consider a home energy storage (battery) system that can store up to 2 units of energy. At every time step, there is a demand for energy in the home which is drawn from 0,1,2 units with equal probability independent of the demands in the previous time steps. At every point in time, you have to satisfy the demand either by discharging the needed energy from the battery or purchasing power from the grid (or a combination of the two). You could also choose to purchase power from the grid to charge your battery. The grid energy price is either H(igh) or L(ow) according to a Markov chain (Price moves from H to L with probability p, and from L to H with probability q). (a) Model the decision making as an infinite horizon MDP where the objective is to minimize the discounted cost of energy purchased over an infinite horizon. (b) Write down a policy a of your choosing. Perform two steps of the operator T, for your policy, followed by one step of T..
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 2 steps with 2 images

Blurred answer
Knowledge Booster
Risk Aversion
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, economics and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Microeconomic Theory
Microeconomic Theory
Economics
ISBN:
9781337517942
Author:
NICHOLSON
Publisher:
Cengage