Which algorithm is best to detect anomalies within a data set of 5k+ user-login events?

ML_newbie asked Oct 5, 2021

443 views

I am trying to build an unsupervised ML model to detect anomalies within 5000+ users' login data. I selected 5 features contained within each of the user-login events (e.g. IP, hour of day, day of week, device_id, OS). I am looking for the best algorithm to use. I am considering using density function to determine probabilities of the feature values and whether an event is an outlier. The problem is that feature values are only relevant to the specific user. For example, you cannot compare login IP across users, login IP is only applicable to the user.
Ultimately, I want to detect events that are changes in a user login behavior, like different IP, day, hour, device_id, or OS, where the more features that have changed increase the probability of an outlier.
At this point, I am not sure how to build a model with data that contains multiple users, because I don't know how to separate the user data so the model is trained per user and finding anomalies within the individual user's features.

I also don't have any labeled data to use for testing, should I fabricate some?

Any advice greatly appreciated.

Thank you!

ML_newbie

120 points

Please log in or register to answer this question.

Related questions

3 3 votes

1 answers 1 answer

7.8k

7.8k views

tofighi asked Feb 3, 2020

7,790 views

How to calculate the probability and accuracy of a Logistic Regression classifier?

How to solve this problem?https://i.imgur.com/8urywpf.jpgQ1) Complete the ? sectionsQ2) Accuracy of system if threshold = 0.5?Q3) Accuracy of system if threshold = 0.95?

tofighi

116k points

tofighi asked Feb 3, 2020

0 0 votes

1 1 answer

893

893 views

rakesh asked Feb 10, 2020

893 views

How to map (string compare) a string with 10000+ strings in DB? which is the best way to do it?

I have a result string like this A, C, D, F, B,In my DB I have 10,000+ combinations like this 1. A, B, C, D 2. A, C, B, I, D, W, Z etc.... Now I want to search for my ...

rakesh

120 points

rakesh asked Feb 10, 2020

4 4 votes

1 1 answer

1.0k

1.0k views

Ahmed Merah asked Oct 12, 2018

1,002 views

If x is the number that comes up when you roll a 20 sided die. What is the expected value of x?

Ahmed Merah

270 points

Ahmed Merah asked Oct 12, 2018

1 1 vote

0 0 answers

618

618 views

Ahmed Merah asked Oct 12, 2018

618 views

What is the probability of getting a King of Hearts?

What is the probability of picking a King of Hearts in a standard 52 card deck?

Ahmed Merah

270 points

Ahmed Merah asked Oct 12, 2018

2 2 votes

1 answers 1 answer

2.0k

2.0k views

KenTheStatsMan asked Oct 1, 2018

1,979 views

A standard deck of 52 cards is shuffled and a single card is drawn. What is the probability that this card is a Joker

This is question 1d in the 2B exercises. The answer says the probability is 0 but I count 4 jokers in a 52 card deck, which means the probability is 1/13. Think I misunde...

KenTheStatsMan

430 points

KenTheStatsMan asked Oct 1, 2018

Which algorithm is best to detect anomalies within a data set of 5k+ user-login events?

Please log in or register to add a comment.

Please log in or register to answer this question.

0 Answers

Related questions

0 reply

Please log in or register to add a comment.

Please log in or register to answer this question.

0 Answers

Related questions

0