markov-process

How to deal with multiple possible rewards in a transition in MDP

sphpmp

2022年5月9日 09:50

Suppose there is a state S with two transitions under action A but both transited states are S'. But the tricky part is that the two rewards are different. In this case, how should I construct the probability and reward matrix?

Topic: markov-process

Category: Data Science

How to set the parameters of a Hidden Markov Model that'll be used to correct the mistakes by a previous classifier?

Sus_Q

2022年4月23日 14:07

Say we've previously used a neural network or some other classifier C with $N$ training samples $I:=\{I_1,...I_N\}$ (that has a sequence or context, but is ignored by C) the, belonging to $K$ classes. Assume, for some reason (probably some training problem or declaring classes), C is confused and doesn't perform well. The way we assign a class using C to each test data $I$ is: $class(I):= arg max _{ {1 \leq j \leq K} } p_j(I)$, where $p_j(I)$ is the …

Topic: markov-hidden-model markov-process neural-network machine-learning

Category: Data Science

How to use HMMs for continuous value prediction

Jeris

2022年4月8日 18:09

I have some time-series data, which I need to use to predict a continuous value for a given time-stamp. I was initially doing it using a Multivariate Regression Model but I later figured that a time-series based problem could be better solved using Hidden Markov Models. The dataset consists of a time-stamp label, around 30 features collected from IoT sensors and then there is one target class which is a continuous variable. The problem is how do I determine the …

Topic: markov-process time-series predictive-modeling machine-learning

Category: Data Science

memory error- python N-th order Markovian transition matrix from a given sequence

Dr. Turkuaz

2022年3月30日 17:32

Ok. What is wrong with you code! I am trying to calculate transition probabilities for each leg. The code works for small array but for the actual dataset I got memory error. I have 64 g version python and maximized the memory usage so i believe need help to code efficiently. import numpy as np # sequence with 3 states -> 0, 1, 2 arr = [0, 1, 0, 0, 0, 2, 2, 1, 1, 1, 0, 0, 0, 0, …

Topic: matrix probability markov-process python

Category: Data Science

What are the differences between Reinforcement Learning (RL) and Supervised Learning?

user10296606

2022年3月23日 04:48

What is the difference between Reinforcement Learning (RL) and Supervised Learning? Does RL hava more difficulty in finding a stable solution? Does Q-learning have more difficulty in finding a stable solution? Does getting stuck in a local minimum happen more in supervised learning? Is this figure correct saying that Supervised Learning is part of RL?

Topic: supervised-learning markov-process reinforcement-learning

Category: Data Science

Using Markov Chains in digital customer journey

Himank Kansal

2022年3月8日 14:24

I have data of each page visited by a customer in a session, my objective is to find out the most optimal path where we see the maximum conversion rate. My idea Is to use Markov Chain to identify that and probably use a mixture of Markov models to avoid bias towards any set of customers. Please let me know in case I am heading in the wrong direction.

Topic: markov-hidden-model data-analysis markov-process python machine-learning

Category: Data Science

Simple Markov Chains Memoryless Property Question

mlgal55

2022年2月19日 11:01

I have a sequential data from time T1 to T6. The rows contain the sequence of states for 50 customers. There are only 3 states in my data. For example, it looks like this: T1 T2 T3 T4 T5 T6 Cust1 C B C A A C My transition matrix X looks like this: A B C A 0.3 0.6 0.1 B 0.5 0.2 0.3 C 0.7 0.1 0.2 Now, we see that at time T6 the state is at …

Topic: markov markov-process

Category: Data Science

If I use Gibbs sampling with a Bayesian model, what do I have to check is memoryless?

pierround

2022年2月13日 01:09

Right now I am trying to better understand how Bayesian modeling works with just the basics. I found through reading tutorials that some very basic Bayesian models like Bayesian Hierarchical Modeling use something called the "Gibbs sampling algorithm", which is a Markov Chain Monte Carlo Method algorithm. I know that, if I am going to do anything with Markov Chains, then I have to test a data or parameter violates the assumption of memoryless. However, I am uncertain what exactly …

Topic: bayesian markov-process

Category: Data Science

Compute for policy the state value function v(s) for each state

Thomas Edelman

2022年2月9日 13:05

The instruction of the question: State A is absorbing. Transition to A from state 1 or 4 yields an immediate reward of 12. All other transitions incur a reward of 1. Transitions are deterministic (i.e. each action maps a state s to a unique successor state s0). For the remainder of this question, we will assume = 1. On this MDP, consider a policy that assigns transition probabilities as indicated in the gure below. E.g.: (move to Aj currently in …

Topic: markov-process reinforcement-learning

Category: Data Science

Markov Process and transition matrix

minattosama

2022年2月8日 23:01

I would like to find some good courses but also a quick response on how to model transition matrix given the states. Imagine having 4 states and the following array [1,2,4,1,3,4,2 etc etc]. What calculations are possible with only an array of states? You can make the array as long as you want, I just gave a random exemple. Python ; Excel; Blog solutions are welcome

Topic: markov-hidden-model bayesian excel markov-process python

Category: Data Science

Machine Learning algorithm for detecting anomalies in large sets of events

Denis Rozimovschii

2022年2月5日 18:05

Let's start with the following hypothetical preconditions: There is traffic: normal and anomaly. Each traffic sample contains a list of events (of variable size) Events happen in order, the possible events set size is ~40000 elements Should run on relatively small amounts of memory and processing power Having a traffic sample (of size 1000 events max), what is the best machine learning algorithm, that fits the preconditions, to identify whether it's an anomaly? Given my limited knowledge in machine learning …

Topic: k-nn markov-process time-series machine-learning

Category: Data Science

Method for predicting future state, based on time spent in previous states

Paul

2022年1月25日 16:33

So what I'm looking for is the best approach to predict a future state. Say we have three states: A, B, C. I want to predict if in the next time-interval (f.e. a day or a week) the state will become C. My (historical) data looks like this: ID Date State 1 2021-12-01 A 1 2021-12-02 B 1 2021-12-06 A 1 2021-12-24 C 2 2021-12-05 A 2 2021-12-12 B 2 2021-12-27 C For a new ID The history could look …

Topic: data-science-model forecasting markov-process time-series python

Category: Data Science

Value function when the policy is deterministic

data_science_learner

2022年1月23日 11:12

This is the value function expression for a stochastic policy: $\displaystyle v_{\pi}(s)=\sum_{a \in \mathcal{A}}\pi(a|s)\bigg(\mathcal{R}_s^a+\gamma \sum_{s' \in \mathcal{S}} \mathbb{P}_{ss'}^a v_{\pi}(s')\bigg) $ Question: What is the form of the value function when the policy is deterministic?

Topic: markov-process reinforcement-learning

Category: Data Science

Jacks car rental problem: why deterministic policies?

data_science_learner

2022年1月22日 20:31

In Sutton & Barto Book: Reinforcement Learning: An Introduction, there is the following problem: I have this question: why are the policies to be considered here are deterministic?

Topic: markov-process reinforcement-learning

Category: Data Science

Can Reinforcement Learning learn to be deceptive?

Nabil

2022年1月19日 16:56

I have seen several exampled of deploying RL agents in deceptive environnement or games and the agent learns to perform its tasks regardless. What about the other way around? Can RL be used to create deceptive agents? An example could be asking an agent a question "What color is this?" and it replies with a lie for example. I am interested on a higher level of "deception" and not a simple if-else program that doesn't tell you what you need …

Topic: markov-process reinforcement-learning machine-learning

Category: Data Science

Computing the state-value function of a Markov decision process from the classical definition

data_science_learner

2022年1月12日 17:46

For the above Markov decision process under given action policy $a_1$, how can I determine the value of state $s_1$ using the state-value definition $v(s)=E[G_t| S_t=s]$ where $G_t$ is the return? Assume that no discount (i.e., $\gamma=1$).

Topic: markov-process reinforcement-learning

Category: Data Science

Find changes in variables into two states

Nathalie

2022年1月11日 15:01

I have a dataframe like this: dframe <- structure(list(c(60, 91, 377, 419, 893, 905), c(-0.6647, -0.0275000000000001, -0.6311, 0.1328, -0.4559, -1.0208), c(-1.6964, -1.3851, -1.1428, -1.4191, -1.2979, -1.441), c(4.1104, 2.998, 3.4623, 1.9545, 3.5166, 3.9912), c(-1.6663, -1.0789, -1.6608, -1.0137, -1.4022, -1.6189 ), c(0.902, 0.5417, 0.2651, -0.4998, 0.72, 1.0902), c(0.061, -0.1321, -0.6613, -0.9655, -0.3879, -0.3222), c(0.6573, -1.8156, -1.1072, -1.6147, -1.7412, -0.8048), c(-1.6561, 3.3495, 3.1694, 4.7327, 3.7275, 3.0135), c(0.2499, -1.5437, -1.3843, -1.8279, -1.487, -1.133), c(1.1265, 0.2224, 0.5074, 0.9983, 0.4906, 0.3672 ), structure(c(3, 1, 3, 1, …

Topic: market-basket-analysis markov-process statistics

Category: Data Science

Reward dependent on (state, action) versus (state, action, successor state)

Neil Slater

2021年8月7日 17:44

I am studying reinforcement learning and I am working methodically through Sutton and Barto's book plus David Silver's lectures. I have noticed a minor difference in how the Markov Decision Processes (MDPs) are defined in those two sources, that affects the formulation of the Bellman equations, and I wonder about the reasoning behind the differences and when I might choose one or the other. In Sutton and Barto, the expected reward function is written $R^a_{ss'}$, whilst in David Silver's lectures …

Topic: markov-process reinforcement-learning

Category: Data Science

How do I choose a discount factor in Markov Decision Problems?

Austin Capobianco

2021年8月2日 08:58

I'm referring to the gamma in the Value function:

Topic: markov-process machine-learning

Category: Data Science

Suggestions for studying Clickstream data

user1147964

2021年6月18日 14:39

I've essentially been handed a dataset of website access history and I'm trying to draw some conclusions from it. The data supplied gives me the web URL, the datetime for when it was accessed, an the unique ID of the user accessing that data. This means that for a given user ID, I can see a timeline of how they went through the website and what pages they looked at. I'd quite like to try clustering these users into different …

Topic: web-scraping markov-hidden-model markov-process

Category: Data Science

About