Interpretation of ROC AUC scoreWhat is the right algorithm to detect segmentations of a line chart?Variance in cross validation score / model selectionOutlier detection by unsupervised algorithm: Fraud DetectionROC curve for different hyperparameters of `RandomForestClassifier`?different results with MEKA vs Scikit-learn!Interpretation of variable or feature importance in Random ForestGradient descent multidimensional linear regression - does learning rate affects concurrency?Decent ROC, but horrible Precision-Recall curveEvaluating the test setLooking for a classification (?) algorithm for linearly separable but unlabeled data points

How to tell your grandparent to not come to fetch you with their car?

Do simulator games use a realistic trajectory to get into orbit?

Confusion around using "des" in sentences

Generate a Graeco-Latin square

Where Mongol herds graze

How to return a security deposit to a tenant

Watts of filament extrusion

1980s live-action movie where individually-coloured nations on clouds fight

Is counterpoint still used today?

Does an ice chest packed full of frozen food need ice?

What is the reason for double NULL check of pointer for mutex lock

Why is one of Madera Municipal's runways labelled with only "R" on both sides?

Is it a problem if <h4>, <h5> and <h6> are smaller than regular text?

Second (easy access) account in case my bank screws up

Why would future John risk sending back a T-800 to save his younger self?

What language is software running on the ISS written in?

Should I avoid hard-packed crusher dust trails with my hybrid?

Is mleccha entirely different from Hindu classification?

At what point in time did Dumbledore ask Snape for this favor?

How is water heavier than petrol, even though its molecular weight is less than petrol?

What ways have you found to get edits from non-LaTeX users?

Are there any instruments that don't produce overtones?

Source that a married woman seduced by a “messianic figure” is still permitted to her husband

What makes Ada the language of choice for the ISS's safety-critical systems?



Interpretation of ROC AUC score


What is the right algorithm to detect segmentations of a line chart?Variance in cross validation score / model selectionOutlier detection by unsupervised algorithm: Fraud DetectionROC curve for different hyperparameters of `RandomForestClassifier`?different results with MEKA vs Scikit-learn!Interpretation of variable or feature importance in Random ForestGradient descent multidimensional linear regression - does learning rate affects concurrency?Decent ROC, but horrible Precision-Recall curveEvaluating the test setLooking for a classification (?) algorithm for linearly separable but unlabeled data points













2












$begingroup$


i tried to evaluate 6 models and after plotting , this what i get :
enter image description here



So i'm wondering , if those results are "Right" ?



Thank's in advance.










share|improve this question











$endgroup$











  • $begingroup$
    Can you give some more information on your project? As Juan said, these AUC numbers are suspiciously good, and this chart alone is not very informative.
    $endgroup$
    – Upper_Case
    May 21 at 15:36










  • $begingroup$
    I have to develop a model to predict whether a person will get hired as DS or not , and here is just some model evaluation , to see what's the best model to use for my task .
    $endgroup$
    – Dimi
    May 21 at 15:43















2












$begingroup$


i tried to evaluate 6 models and after plotting , this what i get :
enter image description here



So i'm wondering , if those results are "Right" ?



Thank's in advance.










share|improve this question











$endgroup$











  • $begingroup$
    Can you give some more information on your project? As Juan said, these AUC numbers are suspiciously good, and this chart alone is not very informative.
    $endgroup$
    – Upper_Case
    May 21 at 15:36










  • $begingroup$
    I have to develop a model to predict whether a person will get hired as DS or not , and here is just some model evaluation , to see what's the best model to use for my task .
    $endgroup$
    – Dimi
    May 21 at 15:43













2












2








2


1



$begingroup$


i tried to evaluate 6 models and after plotting , this what i get :
enter image description here



So i'm wondering , if those results are "Right" ?



Thank's in advance.










share|improve this question











$endgroup$




i tried to evaluate 6 models and after plotting , this what i get :
enter image description here



So i'm wondering , if those results are "Right" ?



Thank's in advance.







random-forest svm logistic-regression model-selection






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited May 22 at 0:57









Esmailian

4,654422




4,654422










asked May 21 at 14:01









DimiDimi

332




332











  • $begingroup$
    Can you give some more information on your project? As Juan said, these AUC numbers are suspiciously good, and this chart alone is not very informative.
    $endgroup$
    – Upper_Case
    May 21 at 15:36










  • $begingroup$
    I have to develop a model to predict whether a person will get hired as DS or not , and here is just some model evaluation , to see what's the best model to use for my task .
    $endgroup$
    – Dimi
    May 21 at 15:43
















  • $begingroup$
    Can you give some more information on your project? As Juan said, these AUC numbers are suspiciously good, and this chart alone is not very informative.
    $endgroup$
    – Upper_Case
    May 21 at 15:36










  • $begingroup$
    I have to develop a model to predict whether a person will get hired as DS or not , and here is just some model evaluation , to see what's the best model to use for my task .
    $endgroup$
    – Dimi
    May 21 at 15:43















$begingroup$
Can you give some more information on your project? As Juan said, these AUC numbers are suspiciously good, and this chart alone is not very informative.
$endgroup$
– Upper_Case
May 21 at 15:36




$begingroup$
Can you give some more information on your project? As Juan said, these AUC numbers are suspiciously good, and this chart alone is not very informative.
$endgroup$
– Upper_Case
May 21 at 15:36












$begingroup$
I have to develop a model to predict whether a person will get hired as DS or not , and here is just some model evaluation , to see what's the best model to use for my task .
$endgroup$
– Dimi
May 21 at 15:43




$begingroup$
I have to develop a model to predict whether a person will get hired as DS or not , and here is just some model evaluation , to see what's the best model to use for my task .
$endgroup$
– Dimi
May 21 at 15:43










2 Answers
2






active

oldest

votes


















5












$begingroup$

Did you evaluate the results in the training set? Or in the test set?



Those results are outstandingly good! Suspiciously good.



I think you tried your results in the training set only, so your results reflect overfitting on your data, which means your model learned the set, it was not generalized (which means it is unapplicable to any other dataset you may encounter in the future, which is not useful).



For comparing ROC between methodologies you should model them being careful for overfitting and try them on a test dataset (a dataset which you never knew before, which can be obtained partitioning your dataset).



In that way your comparison is not measuring which model learns by memory your data.






share|improve this answer









$endgroup$












  • $begingroup$
    Well , I did split my data into Training and testing and this evaluation has been done only on the Training.
    $endgroup$
    – Dimi
    May 21 at 15:37










  • $begingroup$
    Yes! That is what happens! You should assess your model in the dataset your model doesn't know (test dataset), so you can measure the generallity of it.
    $endgroup$
    – Juan Esteban de la Calle
    May 21 at 16:01










  • $begingroup$
    Alright , thank's a lot
    $endgroup$
    – Dimi
    May 21 at 16:06


















2












$begingroup$

I'm not sure that AUC is the right value to use to compare these models. Have a look at this question for a bit more detail.



In any case, the AUC of your training data is not a very informative piece of information, and assessing the performance of your model on the training set isn't enough to determine how "right" your model(s) may be, no matter how you go about it. Such a comparison could be done on a test set, at the earliest, and better still, totally out-of-sample data (as final model training usually includes re-training on the full data set).



Finally, model performance (in application) will be defined by choosing a specific cut point for your predictors (and the corresponding true positive/false positive tradeoff) rather than by looking at the overall ability of your model to discern between outcomes across many possible cut points. Overall AUC may still be interesting, but applied model performance is a more precise question (this question has some good information on that).






share|improve this answer











$endgroup$












  • $begingroup$
    Intresting , Thank you
    $endgroup$
    – Dimi
    May 21 at 15:58











Your Answer








StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f52325%2finterpretation-of-roc-auc-score%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























2 Answers
2






active

oldest

votes








2 Answers
2






active

oldest

votes









active

oldest

votes






active

oldest

votes









5












$begingroup$

Did you evaluate the results in the training set? Or in the test set?



Those results are outstandingly good! Suspiciously good.



I think you tried your results in the training set only, so your results reflect overfitting on your data, which means your model learned the set, it was not generalized (which means it is unapplicable to any other dataset you may encounter in the future, which is not useful).



For comparing ROC between methodologies you should model them being careful for overfitting and try them on a test dataset (a dataset which you never knew before, which can be obtained partitioning your dataset).



In that way your comparison is not measuring which model learns by memory your data.






share|improve this answer









$endgroup$












  • $begingroup$
    Well , I did split my data into Training and testing and this evaluation has been done only on the Training.
    $endgroup$
    – Dimi
    May 21 at 15:37










  • $begingroup$
    Yes! That is what happens! You should assess your model in the dataset your model doesn't know (test dataset), so you can measure the generallity of it.
    $endgroup$
    – Juan Esteban de la Calle
    May 21 at 16:01










  • $begingroup$
    Alright , thank's a lot
    $endgroup$
    – Dimi
    May 21 at 16:06















5












$begingroup$

Did you evaluate the results in the training set? Or in the test set?



Those results are outstandingly good! Suspiciously good.



I think you tried your results in the training set only, so your results reflect overfitting on your data, which means your model learned the set, it was not generalized (which means it is unapplicable to any other dataset you may encounter in the future, which is not useful).



For comparing ROC between methodologies you should model them being careful for overfitting and try them on a test dataset (a dataset which you never knew before, which can be obtained partitioning your dataset).



In that way your comparison is not measuring which model learns by memory your data.






share|improve this answer









$endgroup$












  • $begingroup$
    Well , I did split my data into Training and testing and this evaluation has been done only on the Training.
    $endgroup$
    – Dimi
    May 21 at 15:37










  • $begingroup$
    Yes! That is what happens! You should assess your model in the dataset your model doesn't know (test dataset), so you can measure the generallity of it.
    $endgroup$
    – Juan Esteban de la Calle
    May 21 at 16:01










  • $begingroup$
    Alright , thank's a lot
    $endgroup$
    – Dimi
    May 21 at 16:06













5












5








5





$begingroup$

Did you evaluate the results in the training set? Or in the test set?



Those results are outstandingly good! Suspiciously good.



I think you tried your results in the training set only, so your results reflect overfitting on your data, which means your model learned the set, it was not generalized (which means it is unapplicable to any other dataset you may encounter in the future, which is not useful).



For comparing ROC between methodologies you should model them being careful for overfitting and try them on a test dataset (a dataset which you never knew before, which can be obtained partitioning your dataset).



In that way your comparison is not measuring which model learns by memory your data.






share|improve this answer









$endgroup$



Did you evaluate the results in the training set? Or in the test set?



Those results are outstandingly good! Suspiciously good.



I think you tried your results in the training set only, so your results reflect overfitting on your data, which means your model learned the set, it was not generalized (which means it is unapplicable to any other dataset you may encounter in the future, which is not useful).



For comparing ROC between methodologies you should model them being careful for overfitting and try them on a test dataset (a dataset which you never knew before, which can be obtained partitioning your dataset).



In that way your comparison is not measuring which model learns by memory your data.







share|improve this answer












share|improve this answer



share|improve this answer










answered May 21 at 14:58









Juan Esteban de la CalleJuan Esteban de la Calle

1,381324




1,381324











  • $begingroup$
    Well , I did split my data into Training and testing and this evaluation has been done only on the Training.
    $endgroup$
    – Dimi
    May 21 at 15:37










  • $begingroup$
    Yes! That is what happens! You should assess your model in the dataset your model doesn't know (test dataset), so you can measure the generallity of it.
    $endgroup$
    – Juan Esteban de la Calle
    May 21 at 16:01










  • $begingroup$
    Alright , thank's a lot
    $endgroup$
    – Dimi
    May 21 at 16:06
















  • $begingroup$
    Well , I did split my data into Training and testing and this evaluation has been done only on the Training.
    $endgroup$
    – Dimi
    May 21 at 15:37










  • $begingroup$
    Yes! That is what happens! You should assess your model in the dataset your model doesn't know (test dataset), so you can measure the generallity of it.
    $endgroup$
    – Juan Esteban de la Calle
    May 21 at 16:01










  • $begingroup$
    Alright , thank's a lot
    $endgroup$
    – Dimi
    May 21 at 16:06















$begingroup$
Well , I did split my data into Training and testing and this evaluation has been done only on the Training.
$endgroup$
– Dimi
May 21 at 15:37




$begingroup$
Well , I did split my data into Training and testing and this evaluation has been done only on the Training.
$endgroup$
– Dimi
May 21 at 15:37












$begingroup$
Yes! That is what happens! You should assess your model in the dataset your model doesn't know (test dataset), so you can measure the generallity of it.
$endgroup$
– Juan Esteban de la Calle
May 21 at 16:01




$begingroup$
Yes! That is what happens! You should assess your model in the dataset your model doesn't know (test dataset), so you can measure the generallity of it.
$endgroup$
– Juan Esteban de la Calle
May 21 at 16:01












$begingroup$
Alright , thank's a lot
$endgroup$
– Dimi
May 21 at 16:06




$begingroup$
Alright , thank's a lot
$endgroup$
– Dimi
May 21 at 16:06











2












$begingroup$

I'm not sure that AUC is the right value to use to compare these models. Have a look at this question for a bit more detail.



In any case, the AUC of your training data is not a very informative piece of information, and assessing the performance of your model on the training set isn't enough to determine how "right" your model(s) may be, no matter how you go about it. Such a comparison could be done on a test set, at the earliest, and better still, totally out-of-sample data (as final model training usually includes re-training on the full data set).



Finally, model performance (in application) will be defined by choosing a specific cut point for your predictors (and the corresponding true positive/false positive tradeoff) rather than by looking at the overall ability of your model to discern between outcomes across many possible cut points. Overall AUC may still be interesting, but applied model performance is a more precise question (this question has some good information on that).






share|improve this answer











$endgroup$












  • $begingroup$
    Intresting , Thank you
    $endgroup$
    – Dimi
    May 21 at 15:58















2












$begingroup$

I'm not sure that AUC is the right value to use to compare these models. Have a look at this question for a bit more detail.



In any case, the AUC of your training data is not a very informative piece of information, and assessing the performance of your model on the training set isn't enough to determine how "right" your model(s) may be, no matter how you go about it. Such a comparison could be done on a test set, at the earliest, and better still, totally out-of-sample data (as final model training usually includes re-training on the full data set).



Finally, model performance (in application) will be defined by choosing a specific cut point for your predictors (and the corresponding true positive/false positive tradeoff) rather than by looking at the overall ability of your model to discern between outcomes across many possible cut points. Overall AUC may still be interesting, but applied model performance is a more precise question (this question has some good information on that).






share|improve this answer











$endgroup$












  • $begingroup$
    Intresting , Thank you
    $endgroup$
    – Dimi
    May 21 at 15:58













2












2








2





$begingroup$

I'm not sure that AUC is the right value to use to compare these models. Have a look at this question for a bit more detail.



In any case, the AUC of your training data is not a very informative piece of information, and assessing the performance of your model on the training set isn't enough to determine how "right" your model(s) may be, no matter how you go about it. Such a comparison could be done on a test set, at the earliest, and better still, totally out-of-sample data (as final model training usually includes re-training on the full data set).



Finally, model performance (in application) will be defined by choosing a specific cut point for your predictors (and the corresponding true positive/false positive tradeoff) rather than by looking at the overall ability of your model to discern between outcomes across many possible cut points. Overall AUC may still be interesting, but applied model performance is a more precise question (this question has some good information on that).






share|improve this answer











$endgroup$



I'm not sure that AUC is the right value to use to compare these models. Have a look at this question for a bit more detail.



In any case, the AUC of your training data is not a very informative piece of information, and assessing the performance of your model on the training set isn't enough to determine how "right" your model(s) may be, no matter how you go about it. Such a comparison could be done on a test set, at the earliest, and better still, totally out-of-sample data (as final model training usually includes re-training on the full data set).



Finally, model performance (in application) will be defined by choosing a specific cut point for your predictors (and the corresponding true positive/false positive tradeoff) rather than by looking at the overall ability of your model to discern between outcomes across many possible cut points. Overall AUC may still be interesting, but applied model performance is a more precise question (this question has some good information on that).







share|improve this answer














share|improve this answer



share|improve this answer








edited May 21 at 16:00

























answered May 21 at 15:57









Upper_CaseUpper_Case

28615




28615











  • $begingroup$
    Intresting , Thank you
    $endgroup$
    – Dimi
    May 21 at 15:58
















  • $begingroup$
    Intresting , Thank you
    $endgroup$
    – Dimi
    May 21 at 15:58















$begingroup$
Intresting , Thank you
$endgroup$
– Dimi
May 21 at 15:58




$begingroup$
Intresting , Thank you
$endgroup$
– Dimi
May 21 at 15:58

















draft saved

draft discarded
















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f52325%2finterpretation-of-roc-auc-score%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

How to write a 12-bar blues melodyI-IV-V blues progressionHow to play the bridges in a standard blues progressionHow does Gdim7 fit in C# minor?question on a certain chord progressionMusicology of Melody12 bar blues, spread rhythm: alternative to 6th chord to avoid finger stretchChord progressions/ Root key/ MelodiesHow to put chords (POP-EDM) under a given lead vocal melody (starting from a good knowledge in music theory)Are there “rules” for improvising with the minor pentatonic scale over 12-bar shuffle?Confusion about blues scale and chords

What if the end-user didn't have the required library?What is setup.py?What is a clean, pythonic way to have multiple constructors in Python?What does Ruby have that Python doesn't, and vice versa?What is the reason for having '//' in Python?How do I create a namespace package in Python?How to package shared objects that python modules depend on?setuptools vs. distutils: why is distutils still a thing?Navigation in Windows 10 vs code not going to virtualenv library when the same library is installed at user levelPython create package for local usePackaging a project that uses multiple python versionsWhy is permission denied on pip install except for when “--user” is included at end of command?

Esgonzo ibérico Índice Descrición Distribución Hábitat Ameazas Notas Véxase tamén "Acerca dos nomes dos anfibios e réptiles galegos""Chalcides bedriagai"Chalcides bedriagai en Carrascal, L. M. Salvador, A. (Eds). Enciclopedia virtual de los vertebrados españoles. Museo Nacional de Ciencias Naturales, Madrid. España.Fotos