Inverse Relationship Between Precision and Recall The 2019 Stack Overflow Developer Survey Results Are In Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern) 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsRelationship between KS, AUROC, and GiniTerm for relative recallIs this a good classified model based confusion matrix and classification report?How do I pick “K” for precision at K and recall at K?How to calculate Precision and Recall using confusion matrix in Matlab?Can tuning individual precision and recall classification thresholds improve deep learning models?Matrix Confusion - Get Model PrecisionPrecision and Recall if not binaryWhy validation loss worsens while precision/recall continue to improve?Hands on Machine Learning with Scikit Learn and TensorFlow Confusion Matrix with VERY BAD score

Why did all the guest students take carriages to the Yule Ball?

Still taught to reverse oxidation half cells in electrochemistry?

Typeface like Times New Roman but with "tied" percent sign

What are these Gizmos at Izaña Atmospheric Research Center in Spain?

Take groceries in checked luggage

Did the UK government pay "millions and millions of dollars" to try to snag Julian Assange?

Can a novice safely splice in wire to lengthen 5V charging cable?

Is a pteranodon too powerful as a beast companion for a beast master?

How can I protect witches in combat who wear limited clothing?

Difference between "generating set" and free product?

Converting from Markdown-with-biblatex-commands to LaTeX

A pet rabbit called Belle

Python - Fishing Simulator

How are presidential pardons supposed to be used?

What's the point in a preamp?

Is above average number of years spent on PhD considered a red flag in future academia or industry positions?

Are spiders unable to hurt humans, especially very small spiders?

What aspect of planet Earth must be changed to prevent the industrial revolution?

Empty set is subset of every set? If yes, why that...

Can a 1st-level character have an ability score above 18?

First use of “packing” as in carrying a gun

Does the AirPods case need to be around while listening via an iOS Device?

Can withdrawing asylum be illegal?

How is simplicity better than precision and clarity in prose?



Inverse Relationship Between Precision and Recall



The 2019 Stack Overflow Developer Survey Results Are In
Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsRelationship between KS, AUROC, and GiniTerm for relative recallIs this a good classified model based confusion matrix and classification report?How do I pick “K” for precision at K and recall at K?How to calculate Precision and Recall using confusion matrix in Matlab?Can tuning individual precision and recall classification thresholds improve deep learning models?Matrix Confusion - Get Model PrecisionPrecision and Recall if not binaryWhy validation loss worsens while precision/recall continue to improve?Hands on Machine Learning with Scikit Learn and TensorFlow Confusion Matrix with VERY BAD score










6












$begingroup$


I made some search to learn precision and recall and I saw some graphs represents inverse relationship between precision and recall and I started to think about it to clarify subject. I wonder the inverse relationship always hold? Suppose I have a binary classification problem and there are positive and negative labeled classes. After training some of the actual positive examples are predicted as true positives and some of them false negatives and some of the actual negative examples are predicted as true negatives and some of them false positives. To calculate precision and recall I use these formulas:
$$Precision = fracTPTP + FP$$ and $$Recall = fracTPTP + FN$$ If I decrease false negatives then true positives increases and in that case don't precision and recall both increase?










share|improve this question









New contributor




Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$
















    6












    $begingroup$


    I made some search to learn precision and recall and I saw some graphs represents inverse relationship between precision and recall and I started to think about it to clarify subject. I wonder the inverse relationship always hold? Suppose I have a binary classification problem and there are positive and negative labeled classes. After training some of the actual positive examples are predicted as true positives and some of them false negatives and some of the actual negative examples are predicted as true negatives and some of them false positives. To calculate precision and recall I use these formulas:
    $$Precision = fracTPTP + FP$$ and $$Recall = fracTPTP + FN$$ If I decrease false negatives then true positives increases and in that case don't precision and recall both increase?










    share|improve this question









    New contributor




    Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.







    $endgroup$














      6












      6








      6





      $begingroup$


      I made some search to learn precision and recall and I saw some graphs represents inverse relationship between precision and recall and I started to think about it to clarify subject. I wonder the inverse relationship always hold? Suppose I have a binary classification problem and there are positive and negative labeled classes. After training some of the actual positive examples are predicted as true positives and some of them false negatives and some of the actual negative examples are predicted as true negatives and some of them false positives. To calculate precision and recall I use these formulas:
      $$Precision = fracTPTP + FP$$ and $$Recall = fracTPTP + FN$$ If I decrease false negatives then true positives increases and in that case don't precision and recall both increase?










      share|improve this question









      New contributor




      Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.







      $endgroup$




      I made some search to learn precision and recall and I saw some graphs represents inverse relationship between precision and recall and I started to think about it to clarify subject. I wonder the inverse relationship always hold? Suppose I have a binary classification problem and there are positive and negative labeled classes. After training some of the actual positive examples are predicted as true positives and some of them false negatives and some of the actual negative examples are predicted as true negatives and some of them false positives. To calculate precision and recall I use these formulas:
      $$Precision = fracTPTP + FP$$ and $$Recall = fracTPTP + FN$$ If I decrease false negatives then true positives increases and in that case don't precision and recall both increase?







      accuracy confusion-matrix






      share|improve this question









      New contributor




      Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      share|improve this question









      New contributor




      Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      share|improve this question




      share|improve this question








      edited 2 days ago









      Esmailian

      3,181320




      3,181320






      New contributor




      Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      asked 2 days ago









      Tolga KarahanTolga Karahan

      335




      335




      New contributor




      Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.





      New contributor





      Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      Tolga Karahan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.




















          3 Answers
          3






          active

          oldest

          votes


















          2












          $begingroup$

          If we decrease the false negative (select more positives), recall always increases, but precision may increase or decrease. Generally, for models better than random, precision and recall have an inverse relationship (@pythinker's answer), but for models worse than random, they have a direct relationship (@kbrose's example).



          It is worth noting that we can artificially build a sample that causes a model which is better-than-random on true distribution to perform worse-than-random, so we are assuming that the sample resembles the true distribution.



          Recall



          We have
          $$TP = P - FN$$
          therefore, recall would be
          $$r = fracP-FNP = 1- fracFNP$$
          which always increases by decrease in $FN$.



          Precision



          For precision, the relation is not as straightforward. Lets start with two examples.



          First case: decrease in precision, by decrease in false negative:



          label model prediction
          1 0.8
          0 0.2
          0 0.2
          1 0.2


          For threshold $0.5$ (false negative = $(1, 0.2)$),



          $$p = frac11+0=1$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac22+2=0.5$$



          Second case: increase in precision, by decrease in false negative (the same as @kbrose example):



          label model prediction
          0 1.0
          1 0.4
          0 0.1


          For threshold $0.5$ (false negative = $(1, 0.4)$),



          $$p = frac00+1=0$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac11+2=0.33$$



          It is worth noting that ROC curve for this case is





          Analysis of precision based on ROC curve



          When we lower the threshold, false negative decreases, and true positive [rate] increases, which is equivalent to moving to the right in ROC plot. I did a simulation for better-than-random, random, and worse-than-random models, and plotted ROC, recall, and precision:









          As you can see, by moving to the right, for better-than-random model, precision decreases, for random model, precision has substantial fluctuations, and for worse-than-random model precision increases. And there are slight fluctuations in all three cases. Therefore,




          By increase in recall, if model is better than random, precision generally decreases. If mode is worse than random, precision generally increases.




          Here is the code for simulation:



          import numpy as np
          from sklearn.metrics import roc_curve
          from matplotlib import pyplot

          np.random.seed(123)
          count = 2000
          P = int(count * 0.5)
          N = count - P
          # first half zero, second half one
          y_true = np.concatenate((np.zeros((N, 1)), np.ones((P, 1))))

          title = 'Better-than-random model'
          # title = 'Random model'
          # title = 'Worse-than-random model'
          if title == 'Better-than-random model':
          # GOOD: model output increases from 0 to 1 with noise
          y_score = np.array([p + np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Random model':
          # RANDOM: model output is purely random
          y_score = np.array([np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Worse-than-random model':
          # SUB RANDOM: model output decreases from 0 to -1 (worse than random)
          y_score = np.array([-p + np.random.randint(-1000, 1000)/1000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))

          # calculate ROC (fpr, tpr) points
          fpr, tpr, thresholds = roc_curve(y_true, y_score)
          # calculate recall, precision, and accuracy for corresponding thresholds
          # recall = TP / P
          recall = np.array([np.sum(y_true[y_score > t])/P
          for t in thresholds]).reshape((-1, 1))
          # precision = TP / (TP + FP)
          precision = np.array([np.sum(y_true[y_score > t])/np.count_nonzero(y_score > t)
          for t in thresholds]).reshape((-1, 1))
          # accuracy = (TP + TN) / (P + N)
          accuracy = np.array([(np.sum(y_true[y_score > t]) + np.sum(1 - y_true[y_score < t]))
          /len(y_score)
          for t in thresholds]).reshape((-1, 1))

          # Sort performance measures from min tpr to max tpr
          index = np.argsort(tpr)
          tpr_sorted = tpr[index]
          recall_sorted = recall[index]
          precision_sorted = precision[index]
          accuracy_sorted = accuracy[index]

          # visualize
          fig, ax = pyplot.subplots(3, 1)
          fig.suptitle(title, fontsize=12)

          line = np.arange(0, len(thresholds))/len(thresholds)
          ax[0].plot(fpr, tpr, label='ROC', color='purple')
          ax[0].plot(line, line, '--', label='random', color='black')
          ax[0].set_xlabel('fpr')
          ax[0].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[1].plot(line, recall, label='recall', color='blue')
          ax[1].plot(line, precision, label='precision', color='red')
          ax[1].plot(line, accuracy, label='accuracy', color='black')
          ax[1].set_xlabel('1 - threshold')
          ax[1].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[2].plot(tpr_sorted, recall_sorted, label='recall', color='blue')
          ax[2].plot(tpr_sorted, precision_sorted, label='precision', color='red')
          ax[2].plot(tpr_sorted, accuracy_sorted, label='accuracy', color='black')
          ax[2].set_xlabel('tpr (1 - fnr)')
          ax[2].legend(loc='center left', bbox_to_anchor=(1, 0.5))

          fig.tight_layout()
          fig.subplots_adjust(top=0.88)
          pyplot.show()





          share|improve this answer











          $endgroup$












          • $begingroup$
            So when random phenomena completely rules, in practice it is observed that they generally have inverse relationship. There are different situations but, can we say generally if we increase precision it means that we predict negative examples more accurately and if we increase recall it means that we predict positive examples more accurately?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan First we need to define "more accurately" in terms of TN, TP, etc. For example "accuracy" is for both positives and negatives, i.e. (TP+TN / P+N) which I added it to the plots, it has a rise and a fall for better-than-random models.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            I mean ratio of correctly predicted labels to all labels for a specific class. Like TP / P or TN / N. If I increase precision does it predict negative examples more accurately with increasing TN / N?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan Aha. For better-than-random models, increase in precision means decrease in recall (and vice versa), which is decrease in TP/P (P = TP+FN). For TN/N, we know when threshold is increased (decrease in recall) both TP and FP decrease since we are selecting less positives, thus FP/N decreases, and 1 - FP/N = TN/N increases. So the answer to your question is yes.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            It's good. Finally If I define TP / P as positive recall and TN / N as negative recall then I suppose with increasing precision I increase negative recall and with increasing recall because it is same thing I also increase positive recall. So it looks like matter of increasing negative or positive recall and which one more important to me.
            $endgroup$
            – Tolga Karahan
            yesterday


















          3












          $begingroup$

          Thanks for clear statement of the problem. The point is that if you want to decrease false negatives, you should sufficiently lower the threshold of your decision function. If the false negatives are decreased, as you mentioned, true positives increase but false positives can also increase. As a result, recall will increase and precision will decrease.






          share|improve this answer











          $endgroup$








          • 1




            $begingroup$
            I've just learned this topic and It seems I focused equations to much with neglecting effects of changing model. This explanation helped to clarify things. Thank you.
            $endgroup$
            – Tolga Karahan
            2 days ago











          • $begingroup$
            @TolgaKarahanYou're welcome. I am very pleased my answer helped.
            $endgroup$
            – pythinker
            2 days ago










          • $begingroup$
            This is incorrect. See my answer.
            $endgroup$
            – kbrose
            2 days ago


















          2












          $begingroup$

          You are correct @Tolga, both can increase at the same time. Consider the following data:



          Prediction | True Class
          1.0 | 0
          0.5 | 1
          0.0 | 0


          If you set your cut off point as 0.75, then you have



          $$ Precision = fracTPTP + FP = frac00 + 1 = 0 $$
          $$ Recall = fracTPTP + FN = frac00 + 1 = 0$$



          then if you decrease your cut off point to 0.25, you have



          $$ Precision = fracTPTP + FP = frac11 + 1 = 0.5 $$
          $$ Recall = fracTPTP + FN = frac11 + 0 = 1$$



          and so you can see, both precision and recall increased when we decreased the number of False Negatives.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you. It seems distribution of data is so important and it isn't surprising of course.
            $endgroup$
            – Tolga Karahan
            2 days ago










          • $begingroup$
            But you still need to be realistic. It is unlikely you can decrease the number of False Negatives without increasing the number of False Positives.
            $endgroup$
            – Pedro Henrique Monforte
            2 days ago










          • $begingroup$
            You provide no data and no argument to back up your claim. I provide an example showing exactly why the OP's statement is correct. And I'm the one that needs to be realistic. Really?
            $endgroup$
            – kbrose
            yesterday











          Your Answer








          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );






          Tolga Karahan is a new contributor. Be nice, and check out our Code of Conduct.









          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f49117%2finverse-relationship-between-precision-and-recall%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          3 Answers
          3






          active

          oldest

          votes








          3 Answers
          3






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          2












          $begingroup$

          If we decrease the false negative (select more positives), recall always increases, but precision may increase or decrease. Generally, for models better than random, precision and recall have an inverse relationship (@pythinker's answer), but for models worse than random, they have a direct relationship (@kbrose's example).



          It is worth noting that we can artificially build a sample that causes a model which is better-than-random on true distribution to perform worse-than-random, so we are assuming that the sample resembles the true distribution.



          Recall



          We have
          $$TP = P - FN$$
          therefore, recall would be
          $$r = fracP-FNP = 1- fracFNP$$
          which always increases by decrease in $FN$.



          Precision



          For precision, the relation is not as straightforward. Lets start with two examples.



          First case: decrease in precision, by decrease in false negative:



          label model prediction
          1 0.8
          0 0.2
          0 0.2
          1 0.2


          For threshold $0.5$ (false negative = $(1, 0.2)$),



          $$p = frac11+0=1$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac22+2=0.5$$



          Second case: increase in precision, by decrease in false negative (the same as @kbrose example):



          label model prediction
          0 1.0
          1 0.4
          0 0.1


          For threshold $0.5$ (false negative = $(1, 0.4)$),



          $$p = frac00+1=0$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac11+2=0.33$$



          It is worth noting that ROC curve for this case is





          Analysis of precision based on ROC curve



          When we lower the threshold, false negative decreases, and true positive [rate] increases, which is equivalent to moving to the right in ROC plot. I did a simulation for better-than-random, random, and worse-than-random models, and plotted ROC, recall, and precision:









          As you can see, by moving to the right, for better-than-random model, precision decreases, for random model, precision has substantial fluctuations, and for worse-than-random model precision increases. And there are slight fluctuations in all three cases. Therefore,




          By increase in recall, if model is better than random, precision generally decreases. If mode is worse than random, precision generally increases.




          Here is the code for simulation:



          import numpy as np
          from sklearn.metrics import roc_curve
          from matplotlib import pyplot

          np.random.seed(123)
          count = 2000
          P = int(count * 0.5)
          N = count - P
          # first half zero, second half one
          y_true = np.concatenate((np.zeros((N, 1)), np.ones((P, 1))))

          title = 'Better-than-random model'
          # title = 'Random model'
          # title = 'Worse-than-random model'
          if title == 'Better-than-random model':
          # GOOD: model output increases from 0 to 1 with noise
          y_score = np.array([p + np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Random model':
          # RANDOM: model output is purely random
          y_score = np.array([np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Worse-than-random model':
          # SUB RANDOM: model output decreases from 0 to -1 (worse than random)
          y_score = np.array([-p + np.random.randint(-1000, 1000)/1000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))

          # calculate ROC (fpr, tpr) points
          fpr, tpr, thresholds = roc_curve(y_true, y_score)
          # calculate recall, precision, and accuracy for corresponding thresholds
          # recall = TP / P
          recall = np.array([np.sum(y_true[y_score > t])/P
          for t in thresholds]).reshape((-1, 1))
          # precision = TP / (TP + FP)
          precision = np.array([np.sum(y_true[y_score > t])/np.count_nonzero(y_score > t)
          for t in thresholds]).reshape((-1, 1))
          # accuracy = (TP + TN) / (P + N)
          accuracy = np.array([(np.sum(y_true[y_score > t]) + np.sum(1 - y_true[y_score < t]))
          /len(y_score)
          for t in thresholds]).reshape((-1, 1))

          # Sort performance measures from min tpr to max tpr
          index = np.argsort(tpr)
          tpr_sorted = tpr[index]
          recall_sorted = recall[index]
          precision_sorted = precision[index]
          accuracy_sorted = accuracy[index]

          # visualize
          fig, ax = pyplot.subplots(3, 1)
          fig.suptitle(title, fontsize=12)

          line = np.arange(0, len(thresholds))/len(thresholds)
          ax[0].plot(fpr, tpr, label='ROC', color='purple')
          ax[0].plot(line, line, '--', label='random', color='black')
          ax[0].set_xlabel('fpr')
          ax[0].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[1].plot(line, recall, label='recall', color='blue')
          ax[1].plot(line, precision, label='precision', color='red')
          ax[1].plot(line, accuracy, label='accuracy', color='black')
          ax[1].set_xlabel('1 - threshold')
          ax[1].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[2].plot(tpr_sorted, recall_sorted, label='recall', color='blue')
          ax[2].plot(tpr_sorted, precision_sorted, label='precision', color='red')
          ax[2].plot(tpr_sorted, accuracy_sorted, label='accuracy', color='black')
          ax[2].set_xlabel('tpr (1 - fnr)')
          ax[2].legend(loc='center left', bbox_to_anchor=(1, 0.5))

          fig.tight_layout()
          fig.subplots_adjust(top=0.88)
          pyplot.show()





          share|improve this answer











          $endgroup$












          • $begingroup$
            So when random phenomena completely rules, in practice it is observed that they generally have inverse relationship. There are different situations but, can we say generally if we increase precision it means that we predict negative examples more accurately and if we increase recall it means that we predict positive examples more accurately?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan First we need to define "more accurately" in terms of TN, TP, etc. For example "accuracy" is for both positives and negatives, i.e. (TP+TN / P+N) which I added it to the plots, it has a rise and a fall for better-than-random models.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            I mean ratio of correctly predicted labels to all labels for a specific class. Like TP / P or TN / N. If I increase precision does it predict negative examples more accurately with increasing TN / N?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan Aha. For better-than-random models, increase in precision means decrease in recall (and vice versa), which is decrease in TP/P (P = TP+FN). For TN/N, we know when threshold is increased (decrease in recall) both TP and FP decrease since we are selecting less positives, thus FP/N decreases, and 1 - FP/N = TN/N increases. So the answer to your question is yes.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            It's good. Finally If I define TP / P as positive recall and TN / N as negative recall then I suppose with increasing precision I increase negative recall and with increasing recall because it is same thing I also increase positive recall. So it looks like matter of increasing negative or positive recall and which one more important to me.
            $endgroup$
            – Tolga Karahan
            yesterday















          2












          $begingroup$

          If we decrease the false negative (select more positives), recall always increases, but precision may increase or decrease. Generally, for models better than random, precision and recall have an inverse relationship (@pythinker's answer), but for models worse than random, they have a direct relationship (@kbrose's example).



          It is worth noting that we can artificially build a sample that causes a model which is better-than-random on true distribution to perform worse-than-random, so we are assuming that the sample resembles the true distribution.



          Recall



          We have
          $$TP = P - FN$$
          therefore, recall would be
          $$r = fracP-FNP = 1- fracFNP$$
          which always increases by decrease in $FN$.



          Precision



          For precision, the relation is not as straightforward. Lets start with two examples.



          First case: decrease in precision, by decrease in false negative:



          label model prediction
          1 0.8
          0 0.2
          0 0.2
          1 0.2


          For threshold $0.5$ (false negative = $(1, 0.2)$),



          $$p = frac11+0=1$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac22+2=0.5$$



          Second case: increase in precision, by decrease in false negative (the same as @kbrose example):



          label model prediction
          0 1.0
          1 0.4
          0 0.1


          For threshold $0.5$ (false negative = $(1, 0.4)$),



          $$p = frac00+1=0$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac11+2=0.33$$



          It is worth noting that ROC curve for this case is





          Analysis of precision based on ROC curve



          When we lower the threshold, false negative decreases, and true positive [rate] increases, which is equivalent to moving to the right in ROC plot. I did a simulation for better-than-random, random, and worse-than-random models, and plotted ROC, recall, and precision:









          As you can see, by moving to the right, for better-than-random model, precision decreases, for random model, precision has substantial fluctuations, and for worse-than-random model precision increases. And there are slight fluctuations in all three cases. Therefore,




          By increase in recall, if model is better than random, precision generally decreases. If mode is worse than random, precision generally increases.




          Here is the code for simulation:



          import numpy as np
          from sklearn.metrics import roc_curve
          from matplotlib import pyplot

          np.random.seed(123)
          count = 2000
          P = int(count * 0.5)
          N = count - P
          # first half zero, second half one
          y_true = np.concatenate((np.zeros((N, 1)), np.ones((P, 1))))

          title = 'Better-than-random model'
          # title = 'Random model'
          # title = 'Worse-than-random model'
          if title == 'Better-than-random model':
          # GOOD: model output increases from 0 to 1 with noise
          y_score = np.array([p + np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Random model':
          # RANDOM: model output is purely random
          y_score = np.array([np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Worse-than-random model':
          # SUB RANDOM: model output decreases from 0 to -1 (worse than random)
          y_score = np.array([-p + np.random.randint(-1000, 1000)/1000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))

          # calculate ROC (fpr, tpr) points
          fpr, tpr, thresholds = roc_curve(y_true, y_score)
          # calculate recall, precision, and accuracy for corresponding thresholds
          # recall = TP / P
          recall = np.array([np.sum(y_true[y_score > t])/P
          for t in thresholds]).reshape((-1, 1))
          # precision = TP / (TP + FP)
          precision = np.array([np.sum(y_true[y_score > t])/np.count_nonzero(y_score > t)
          for t in thresholds]).reshape((-1, 1))
          # accuracy = (TP + TN) / (P + N)
          accuracy = np.array([(np.sum(y_true[y_score > t]) + np.sum(1 - y_true[y_score < t]))
          /len(y_score)
          for t in thresholds]).reshape((-1, 1))

          # Sort performance measures from min tpr to max tpr
          index = np.argsort(tpr)
          tpr_sorted = tpr[index]
          recall_sorted = recall[index]
          precision_sorted = precision[index]
          accuracy_sorted = accuracy[index]

          # visualize
          fig, ax = pyplot.subplots(3, 1)
          fig.suptitle(title, fontsize=12)

          line = np.arange(0, len(thresholds))/len(thresholds)
          ax[0].plot(fpr, tpr, label='ROC', color='purple')
          ax[0].plot(line, line, '--', label='random', color='black')
          ax[0].set_xlabel('fpr')
          ax[0].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[1].plot(line, recall, label='recall', color='blue')
          ax[1].plot(line, precision, label='precision', color='red')
          ax[1].plot(line, accuracy, label='accuracy', color='black')
          ax[1].set_xlabel('1 - threshold')
          ax[1].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[2].plot(tpr_sorted, recall_sorted, label='recall', color='blue')
          ax[2].plot(tpr_sorted, precision_sorted, label='precision', color='red')
          ax[2].plot(tpr_sorted, accuracy_sorted, label='accuracy', color='black')
          ax[2].set_xlabel('tpr (1 - fnr)')
          ax[2].legend(loc='center left', bbox_to_anchor=(1, 0.5))

          fig.tight_layout()
          fig.subplots_adjust(top=0.88)
          pyplot.show()





          share|improve this answer











          $endgroup$












          • $begingroup$
            So when random phenomena completely rules, in practice it is observed that they generally have inverse relationship. There are different situations but, can we say generally if we increase precision it means that we predict negative examples more accurately and if we increase recall it means that we predict positive examples more accurately?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan First we need to define "more accurately" in terms of TN, TP, etc. For example "accuracy" is for both positives and negatives, i.e. (TP+TN / P+N) which I added it to the plots, it has a rise and a fall for better-than-random models.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            I mean ratio of correctly predicted labels to all labels for a specific class. Like TP / P or TN / N. If I increase precision does it predict negative examples more accurately with increasing TN / N?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan Aha. For better-than-random models, increase in precision means decrease in recall (and vice versa), which is decrease in TP/P (P = TP+FN). For TN/N, we know when threshold is increased (decrease in recall) both TP and FP decrease since we are selecting less positives, thus FP/N decreases, and 1 - FP/N = TN/N increases. So the answer to your question is yes.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            It's good. Finally If I define TP / P as positive recall and TN / N as negative recall then I suppose with increasing precision I increase negative recall and with increasing recall because it is same thing I also increase positive recall. So it looks like matter of increasing negative or positive recall and which one more important to me.
            $endgroup$
            – Tolga Karahan
            yesterday













          2












          2








          2





          $begingroup$

          If we decrease the false negative (select more positives), recall always increases, but precision may increase or decrease. Generally, for models better than random, precision and recall have an inverse relationship (@pythinker's answer), but for models worse than random, they have a direct relationship (@kbrose's example).



          It is worth noting that we can artificially build a sample that causes a model which is better-than-random on true distribution to perform worse-than-random, so we are assuming that the sample resembles the true distribution.



          Recall



          We have
          $$TP = P - FN$$
          therefore, recall would be
          $$r = fracP-FNP = 1- fracFNP$$
          which always increases by decrease in $FN$.



          Precision



          For precision, the relation is not as straightforward. Lets start with two examples.



          First case: decrease in precision, by decrease in false negative:



          label model prediction
          1 0.8
          0 0.2
          0 0.2
          1 0.2


          For threshold $0.5$ (false negative = $(1, 0.2)$),



          $$p = frac11+0=1$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac22+2=0.5$$



          Second case: increase in precision, by decrease in false negative (the same as @kbrose example):



          label model prediction
          0 1.0
          1 0.4
          0 0.1


          For threshold $0.5$ (false negative = $(1, 0.4)$),



          $$p = frac00+1=0$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac11+2=0.33$$



          It is worth noting that ROC curve for this case is





          Analysis of precision based on ROC curve



          When we lower the threshold, false negative decreases, and true positive [rate] increases, which is equivalent to moving to the right in ROC plot. I did a simulation for better-than-random, random, and worse-than-random models, and plotted ROC, recall, and precision:









          As you can see, by moving to the right, for better-than-random model, precision decreases, for random model, precision has substantial fluctuations, and for worse-than-random model precision increases. And there are slight fluctuations in all three cases. Therefore,




          By increase in recall, if model is better than random, precision generally decreases. If mode is worse than random, precision generally increases.




          Here is the code for simulation:



          import numpy as np
          from sklearn.metrics import roc_curve
          from matplotlib import pyplot

          np.random.seed(123)
          count = 2000
          P = int(count * 0.5)
          N = count - P
          # first half zero, second half one
          y_true = np.concatenate((np.zeros((N, 1)), np.ones((P, 1))))

          title = 'Better-than-random model'
          # title = 'Random model'
          # title = 'Worse-than-random model'
          if title == 'Better-than-random model':
          # GOOD: model output increases from 0 to 1 with noise
          y_score = np.array([p + np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Random model':
          # RANDOM: model output is purely random
          y_score = np.array([np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Worse-than-random model':
          # SUB RANDOM: model output decreases from 0 to -1 (worse than random)
          y_score = np.array([-p + np.random.randint(-1000, 1000)/1000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))

          # calculate ROC (fpr, tpr) points
          fpr, tpr, thresholds = roc_curve(y_true, y_score)
          # calculate recall, precision, and accuracy for corresponding thresholds
          # recall = TP / P
          recall = np.array([np.sum(y_true[y_score > t])/P
          for t in thresholds]).reshape((-1, 1))
          # precision = TP / (TP + FP)
          precision = np.array([np.sum(y_true[y_score > t])/np.count_nonzero(y_score > t)
          for t in thresholds]).reshape((-1, 1))
          # accuracy = (TP + TN) / (P + N)
          accuracy = np.array([(np.sum(y_true[y_score > t]) + np.sum(1 - y_true[y_score < t]))
          /len(y_score)
          for t in thresholds]).reshape((-1, 1))

          # Sort performance measures from min tpr to max tpr
          index = np.argsort(tpr)
          tpr_sorted = tpr[index]
          recall_sorted = recall[index]
          precision_sorted = precision[index]
          accuracy_sorted = accuracy[index]

          # visualize
          fig, ax = pyplot.subplots(3, 1)
          fig.suptitle(title, fontsize=12)

          line = np.arange(0, len(thresholds))/len(thresholds)
          ax[0].plot(fpr, tpr, label='ROC', color='purple')
          ax[0].plot(line, line, '--', label='random', color='black')
          ax[0].set_xlabel('fpr')
          ax[0].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[1].plot(line, recall, label='recall', color='blue')
          ax[1].plot(line, precision, label='precision', color='red')
          ax[1].plot(line, accuracy, label='accuracy', color='black')
          ax[1].set_xlabel('1 - threshold')
          ax[1].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[2].plot(tpr_sorted, recall_sorted, label='recall', color='blue')
          ax[2].plot(tpr_sorted, precision_sorted, label='precision', color='red')
          ax[2].plot(tpr_sorted, accuracy_sorted, label='accuracy', color='black')
          ax[2].set_xlabel('tpr (1 - fnr)')
          ax[2].legend(loc='center left', bbox_to_anchor=(1, 0.5))

          fig.tight_layout()
          fig.subplots_adjust(top=0.88)
          pyplot.show()





          share|improve this answer











          $endgroup$



          If we decrease the false negative (select more positives), recall always increases, but precision may increase or decrease. Generally, for models better than random, precision and recall have an inverse relationship (@pythinker's answer), but for models worse than random, they have a direct relationship (@kbrose's example).



          It is worth noting that we can artificially build a sample that causes a model which is better-than-random on true distribution to perform worse-than-random, so we are assuming that the sample resembles the true distribution.



          Recall



          We have
          $$TP = P - FN$$
          therefore, recall would be
          $$r = fracP-FNP = 1- fracFNP$$
          which always increases by decrease in $FN$.



          Precision



          For precision, the relation is not as straightforward. Lets start with two examples.



          First case: decrease in precision, by decrease in false negative:



          label model prediction
          1 0.8
          0 0.2
          0 0.2
          1 0.2


          For threshold $0.5$ (false negative = $(1, 0.2)$),



          $$p = frac11+0=1$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac22+2=0.5$$



          Second case: increase in precision, by decrease in false negative (the same as @kbrose example):



          label model prediction
          0 1.0
          1 0.4
          0 0.1


          For threshold $0.5$ (false negative = $(1, 0.4)$),



          $$p = frac00+1=0$$



          For threshold $0.0$ (false negative = $$),



          $$p = frac11+2=0.33$$



          It is worth noting that ROC curve for this case is





          Analysis of precision based on ROC curve



          When we lower the threshold, false negative decreases, and true positive [rate] increases, which is equivalent to moving to the right in ROC plot. I did a simulation for better-than-random, random, and worse-than-random models, and plotted ROC, recall, and precision:









          As you can see, by moving to the right, for better-than-random model, precision decreases, for random model, precision has substantial fluctuations, and for worse-than-random model precision increases. And there are slight fluctuations in all three cases. Therefore,




          By increase in recall, if model is better than random, precision generally decreases. If mode is worse than random, precision generally increases.




          Here is the code for simulation:



          import numpy as np
          from sklearn.metrics import roc_curve
          from matplotlib import pyplot

          np.random.seed(123)
          count = 2000
          P = int(count * 0.5)
          N = count - P
          # first half zero, second half one
          y_true = np.concatenate((np.zeros((N, 1)), np.ones((P, 1))))

          title = 'Better-than-random model'
          # title = 'Random model'
          # title = 'Worse-than-random model'
          if title == 'Better-than-random model':
          # GOOD: model output increases from 0 to 1 with noise
          y_score = np.array([p + np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Random model':
          # RANDOM: model output is purely random
          y_score = np.array([np.random.randint(-1000, 1000)/3000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))
          elif title == 'Worse-than-random model':
          # SUB RANDOM: model output decreases from 0 to -1 (worse than random)
          y_score = np.array([-p + np.random.randint(-1000, 1000)/1000
          for p in np.arange(0, 1, 1.0 / count)]).reshape((-1, 1))

          # calculate ROC (fpr, tpr) points
          fpr, tpr, thresholds = roc_curve(y_true, y_score)
          # calculate recall, precision, and accuracy for corresponding thresholds
          # recall = TP / P
          recall = np.array([np.sum(y_true[y_score > t])/P
          for t in thresholds]).reshape((-1, 1))
          # precision = TP / (TP + FP)
          precision = np.array([np.sum(y_true[y_score > t])/np.count_nonzero(y_score > t)
          for t in thresholds]).reshape((-1, 1))
          # accuracy = (TP + TN) / (P + N)
          accuracy = np.array([(np.sum(y_true[y_score > t]) + np.sum(1 - y_true[y_score < t]))
          /len(y_score)
          for t in thresholds]).reshape((-1, 1))

          # Sort performance measures from min tpr to max tpr
          index = np.argsort(tpr)
          tpr_sorted = tpr[index]
          recall_sorted = recall[index]
          precision_sorted = precision[index]
          accuracy_sorted = accuracy[index]

          # visualize
          fig, ax = pyplot.subplots(3, 1)
          fig.suptitle(title, fontsize=12)

          line = np.arange(0, len(thresholds))/len(thresholds)
          ax[0].plot(fpr, tpr, label='ROC', color='purple')
          ax[0].plot(line, line, '--', label='random', color='black')
          ax[0].set_xlabel('fpr')
          ax[0].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[1].plot(line, recall, label='recall', color='blue')
          ax[1].plot(line, precision, label='precision', color='red')
          ax[1].plot(line, accuracy, label='accuracy', color='black')
          ax[1].set_xlabel('1 - threshold')
          ax[1].legend(loc='center left', bbox_to_anchor=(1, 0.5))
          ax[2].plot(tpr_sorted, recall_sorted, label='recall', color='blue')
          ax[2].plot(tpr_sorted, precision_sorted, label='precision', color='red')
          ax[2].plot(tpr_sorted, accuracy_sorted, label='accuracy', color='black')
          ax[2].set_xlabel('tpr (1 - fnr)')
          ax[2].legend(loc='center left', bbox_to_anchor=(1, 0.5))

          fig.tight_layout()
          fig.subplots_adjust(top=0.88)
          pyplot.show()






          share|improve this answer














          share|improve this answer



          share|improve this answer








          edited yesterday

























          answered 2 days ago









          EsmailianEsmailian

          3,181320




          3,181320











          • $begingroup$
            So when random phenomena completely rules, in practice it is observed that they generally have inverse relationship. There are different situations but, can we say generally if we increase precision it means that we predict negative examples more accurately and if we increase recall it means that we predict positive examples more accurately?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan First we need to define "more accurately" in terms of TN, TP, etc. For example "accuracy" is for both positives and negatives, i.e. (TP+TN / P+N) which I added it to the plots, it has a rise and a fall for better-than-random models.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            I mean ratio of correctly predicted labels to all labels for a specific class. Like TP / P or TN / N. If I increase precision does it predict negative examples more accurately with increasing TN / N?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan Aha. For better-than-random models, increase in precision means decrease in recall (and vice versa), which is decrease in TP/P (P = TP+FN). For TN/N, we know when threshold is increased (decrease in recall) both TP and FP decrease since we are selecting less positives, thus FP/N decreases, and 1 - FP/N = TN/N increases. So the answer to your question is yes.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            It's good. Finally If I define TP / P as positive recall and TN / N as negative recall then I suppose with increasing precision I increase negative recall and with increasing recall because it is same thing I also increase positive recall. So it looks like matter of increasing negative or positive recall and which one more important to me.
            $endgroup$
            – Tolga Karahan
            yesterday
















          • $begingroup$
            So when random phenomena completely rules, in practice it is observed that they generally have inverse relationship. There are different situations but, can we say generally if we increase precision it means that we predict negative examples more accurately and if we increase recall it means that we predict positive examples more accurately?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan First we need to define "more accurately" in terms of TN, TP, etc. For example "accuracy" is for both positives and negatives, i.e. (TP+TN / P+N) which I added it to the plots, it has a rise and a fall for better-than-random models.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            I mean ratio of correctly predicted labels to all labels for a specific class. Like TP / P or TN / N. If I increase precision does it predict negative examples more accurately with increasing TN / N?
            $endgroup$
            – Tolga Karahan
            yesterday











          • $begingroup$
            @TolgaKarahan Aha. For better-than-random models, increase in precision means decrease in recall (and vice versa), which is decrease in TP/P (P = TP+FN). For TN/N, we know when threshold is increased (decrease in recall) both TP and FP decrease since we are selecting less positives, thus FP/N decreases, and 1 - FP/N = TN/N increases. So the answer to your question is yes.
            $endgroup$
            – Esmailian
            yesterday











          • $begingroup$
            It's good. Finally If I define TP / P as positive recall and TN / N as negative recall then I suppose with increasing precision I increase negative recall and with increasing recall because it is same thing I also increase positive recall. So it looks like matter of increasing negative or positive recall and which one more important to me.
            $endgroup$
            – Tolga Karahan
            yesterday















          $begingroup$
          So when random phenomena completely rules, in practice it is observed that they generally have inverse relationship. There are different situations but, can we say generally if we increase precision it means that we predict negative examples more accurately and if we increase recall it means that we predict positive examples more accurately?
          $endgroup$
          – Tolga Karahan
          yesterday





          $begingroup$
          So when random phenomena completely rules, in practice it is observed that they generally have inverse relationship. There are different situations but, can we say generally if we increase precision it means that we predict negative examples more accurately and if we increase recall it means that we predict positive examples more accurately?
          $endgroup$
          – Tolga Karahan
          yesterday













          $begingroup$
          @TolgaKarahan First we need to define "more accurately" in terms of TN, TP, etc. For example "accuracy" is for both positives and negatives, i.e. (TP+TN / P+N) which I added it to the plots, it has a rise and a fall for better-than-random models.
          $endgroup$
          – Esmailian
          yesterday





          $begingroup$
          @TolgaKarahan First we need to define "more accurately" in terms of TN, TP, etc. For example "accuracy" is for both positives and negatives, i.e. (TP+TN / P+N) which I added it to the plots, it has a rise and a fall for better-than-random models.
          $endgroup$
          – Esmailian
          yesterday













          $begingroup$
          I mean ratio of correctly predicted labels to all labels for a specific class. Like TP / P or TN / N. If I increase precision does it predict negative examples more accurately with increasing TN / N?
          $endgroup$
          – Tolga Karahan
          yesterday





          $begingroup$
          I mean ratio of correctly predicted labels to all labels for a specific class. Like TP / P or TN / N. If I increase precision does it predict negative examples more accurately with increasing TN / N?
          $endgroup$
          – Tolga Karahan
          yesterday













          $begingroup$
          @TolgaKarahan Aha. For better-than-random models, increase in precision means decrease in recall (and vice versa), which is decrease in TP/P (P = TP+FN). For TN/N, we know when threshold is increased (decrease in recall) both TP and FP decrease since we are selecting less positives, thus FP/N decreases, and 1 - FP/N = TN/N increases. So the answer to your question is yes.
          $endgroup$
          – Esmailian
          yesterday





          $begingroup$
          @TolgaKarahan Aha. For better-than-random models, increase in precision means decrease in recall (and vice versa), which is decrease in TP/P (P = TP+FN). For TN/N, we know when threshold is increased (decrease in recall) both TP and FP decrease since we are selecting less positives, thus FP/N decreases, and 1 - FP/N = TN/N increases. So the answer to your question is yes.
          $endgroup$
          – Esmailian
          yesterday













          $begingroup$
          It's good. Finally If I define TP / P as positive recall and TN / N as negative recall then I suppose with increasing precision I increase negative recall and with increasing recall because it is same thing I also increase positive recall. So it looks like matter of increasing negative or positive recall and which one more important to me.
          $endgroup$
          – Tolga Karahan
          yesterday




          $begingroup$
          It's good. Finally If I define TP / P as positive recall and TN / N as negative recall then I suppose with increasing precision I increase negative recall and with increasing recall because it is same thing I also increase positive recall. So it looks like matter of increasing negative or positive recall and which one more important to me.
          $endgroup$
          – Tolga Karahan
          yesterday











          3












          $begingroup$

          Thanks for clear statement of the problem. The point is that if you want to decrease false negatives, you should sufficiently lower the threshold of your decision function. If the false negatives are decreased, as you mentioned, true positives increase but false positives can also increase. As a result, recall will increase and precision will decrease.






          share|improve this answer











          $endgroup$








          • 1




            $begingroup$
            I've just learned this topic and It seems I focused equations to much with neglecting effects of changing model. This explanation helped to clarify things. Thank you.
            $endgroup$
            – Tolga Karahan
            2 days ago











          • $begingroup$
            @TolgaKarahanYou're welcome. I am very pleased my answer helped.
            $endgroup$
            – pythinker
            2 days ago










          • $begingroup$
            This is incorrect. See my answer.
            $endgroup$
            – kbrose
            2 days ago















          3












          $begingroup$

          Thanks for clear statement of the problem. The point is that if you want to decrease false negatives, you should sufficiently lower the threshold of your decision function. If the false negatives are decreased, as you mentioned, true positives increase but false positives can also increase. As a result, recall will increase and precision will decrease.






          share|improve this answer











          $endgroup$








          • 1




            $begingroup$
            I've just learned this topic and It seems I focused equations to much with neglecting effects of changing model. This explanation helped to clarify things. Thank you.
            $endgroup$
            – Tolga Karahan
            2 days ago











          • $begingroup$
            @TolgaKarahanYou're welcome. I am very pleased my answer helped.
            $endgroup$
            – pythinker
            2 days ago










          • $begingroup$
            This is incorrect. See my answer.
            $endgroup$
            – kbrose
            2 days ago













          3












          3








          3





          $begingroup$

          Thanks for clear statement of the problem. The point is that if you want to decrease false negatives, you should sufficiently lower the threshold of your decision function. If the false negatives are decreased, as you mentioned, true positives increase but false positives can also increase. As a result, recall will increase and precision will decrease.






          share|improve this answer











          $endgroup$



          Thanks for clear statement of the problem. The point is that if you want to decrease false negatives, you should sufficiently lower the threshold of your decision function. If the false negatives are decreased, as you mentioned, true positives increase but false positives can also increase. As a result, recall will increase and precision will decrease.







          share|improve this answer














          share|improve this answer



          share|improve this answer








          edited 2 days ago









          Pedro Henrique Monforte

          424112




          424112










          answered 2 days ago









          pythinkerpythinker

          8291213




          8291213







          • 1




            $begingroup$
            I've just learned this topic and It seems I focused equations to much with neglecting effects of changing model. This explanation helped to clarify things. Thank you.
            $endgroup$
            – Tolga Karahan
            2 days ago











          • $begingroup$
            @TolgaKarahanYou're welcome. I am very pleased my answer helped.
            $endgroup$
            – pythinker
            2 days ago










          • $begingroup$
            This is incorrect. See my answer.
            $endgroup$
            – kbrose
            2 days ago












          • 1




            $begingroup$
            I've just learned this topic and It seems I focused equations to much with neglecting effects of changing model. This explanation helped to clarify things. Thank you.
            $endgroup$
            – Tolga Karahan
            2 days ago











          • $begingroup$
            @TolgaKarahanYou're welcome. I am very pleased my answer helped.
            $endgroup$
            – pythinker
            2 days ago










          • $begingroup$
            This is incorrect. See my answer.
            $endgroup$
            – kbrose
            2 days ago







          1




          1




          $begingroup$
          I've just learned this topic and It seems I focused equations to much with neglecting effects of changing model. This explanation helped to clarify things. Thank you.
          $endgroup$
          – Tolga Karahan
          2 days ago





          $begingroup$
          I've just learned this topic and It seems I focused equations to much with neglecting effects of changing model. This explanation helped to clarify things. Thank you.
          $endgroup$
          – Tolga Karahan
          2 days ago













          $begingroup$
          @TolgaKarahanYou're welcome. I am very pleased my answer helped.
          $endgroup$
          – pythinker
          2 days ago




          $begingroup$
          @TolgaKarahanYou're welcome. I am very pleased my answer helped.
          $endgroup$
          – pythinker
          2 days ago












          $begingroup$
          This is incorrect. See my answer.
          $endgroup$
          – kbrose
          2 days ago




          $begingroup$
          This is incorrect. See my answer.
          $endgroup$
          – kbrose
          2 days ago











          2












          $begingroup$

          You are correct @Tolga, both can increase at the same time. Consider the following data:



          Prediction | True Class
          1.0 | 0
          0.5 | 1
          0.0 | 0


          If you set your cut off point as 0.75, then you have



          $$ Precision = fracTPTP + FP = frac00 + 1 = 0 $$
          $$ Recall = fracTPTP + FN = frac00 + 1 = 0$$



          then if you decrease your cut off point to 0.25, you have



          $$ Precision = fracTPTP + FP = frac11 + 1 = 0.5 $$
          $$ Recall = fracTPTP + FN = frac11 + 0 = 1$$



          and so you can see, both precision and recall increased when we decreased the number of False Negatives.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you. It seems distribution of data is so important and it isn't surprising of course.
            $endgroup$
            – Tolga Karahan
            2 days ago










          • $begingroup$
            But you still need to be realistic. It is unlikely you can decrease the number of False Negatives without increasing the number of False Positives.
            $endgroup$
            – Pedro Henrique Monforte
            2 days ago










          • $begingroup$
            You provide no data and no argument to back up your claim. I provide an example showing exactly why the OP's statement is correct. And I'm the one that needs to be realistic. Really?
            $endgroup$
            – kbrose
            yesterday















          2












          $begingroup$

          You are correct @Tolga, both can increase at the same time. Consider the following data:



          Prediction | True Class
          1.0 | 0
          0.5 | 1
          0.0 | 0


          If you set your cut off point as 0.75, then you have



          $$ Precision = fracTPTP + FP = frac00 + 1 = 0 $$
          $$ Recall = fracTPTP + FN = frac00 + 1 = 0$$



          then if you decrease your cut off point to 0.25, you have



          $$ Precision = fracTPTP + FP = frac11 + 1 = 0.5 $$
          $$ Recall = fracTPTP + FN = frac11 + 0 = 1$$



          and so you can see, both precision and recall increased when we decreased the number of False Negatives.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you. It seems distribution of data is so important and it isn't surprising of course.
            $endgroup$
            – Tolga Karahan
            2 days ago










          • $begingroup$
            But you still need to be realistic. It is unlikely you can decrease the number of False Negatives without increasing the number of False Positives.
            $endgroup$
            – Pedro Henrique Monforte
            2 days ago










          • $begingroup$
            You provide no data and no argument to back up your claim. I provide an example showing exactly why the OP's statement is correct. And I'm the one that needs to be realistic. Really?
            $endgroup$
            – kbrose
            yesterday













          2












          2








          2





          $begingroup$

          You are correct @Tolga, both can increase at the same time. Consider the following data:



          Prediction | True Class
          1.0 | 0
          0.5 | 1
          0.0 | 0


          If you set your cut off point as 0.75, then you have



          $$ Precision = fracTPTP + FP = frac00 + 1 = 0 $$
          $$ Recall = fracTPTP + FN = frac00 + 1 = 0$$



          then if you decrease your cut off point to 0.25, you have



          $$ Precision = fracTPTP + FP = frac11 + 1 = 0.5 $$
          $$ Recall = fracTPTP + FN = frac11 + 0 = 1$$



          and so you can see, both precision and recall increased when we decreased the number of False Negatives.






          share|improve this answer









          $endgroup$



          You are correct @Tolga, both can increase at the same time. Consider the following data:



          Prediction | True Class
          1.0 | 0
          0.5 | 1
          0.0 | 0


          If you set your cut off point as 0.75, then you have



          $$ Precision = fracTPTP + FP = frac00 + 1 = 0 $$
          $$ Recall = fracTPTP + FN = frac00 + 1 = 0$$



          then if you decrease your cut off point to 0.25, you have



          $$ Precision = fracTPTP + FP = frac11 + 1 = 0.5 $$
          $$ Recall = fracTPTP + FN = frac11 + 0 = 1$$



          and so you can see, both precision and recall increased when we decreased the number of False Negatives.







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered 2 days ago









          kbrosekbrose

          1,053313




          1,053313











          • $begingroup$
            Thank you. It seems distribution of data is so important and it isn't surprising of course.
            $endgroup$
            – Tolga Karahan
            2 days ago










          • $begingroup$
            But you still need to be realistic. It is unlikely you can decrease the number of False Negatives without increasing the number of False Positives.
            $endgroup$
            – Pedro Henrique Monforte
            2 days ago










          • $begingroup$
            You provide no data and no argument to back up your claim. I provide an example showing exactly why the OP's statement is correct. And I'm the one that needs to be realistic. Really?
            $endgroup$
            – kbrose
            yesterday
















          • $begingroup$
            Thank you. It seems distribution of data is so important and it isn't surprising of course.
            $endgroup$
            – Tolga Karahan
            2 days ago










          • $begingroup$
            But you still need to be realistic. It is unlikely you can decrease the number of False Negatives without increasing the number of False Positives.
            $endgroup$
            – Pedro Henrique Monforte
            2 days ago










          • $begingroup$
            You provide no data and no argument to back up your claim. I provide an example showing exactly why the OP's statement is correct. And I'm the one that needs to be realistic. Really?
            $endgroup$
            – kbrose
            yesterday















          $begingroup$
          Thank you. It seems distribution of data is so important and it isn't surprising of course.
          $endgroup$
          – Tolga Karahan
          2 days ago




          $begingroup$
          Thank you. It seems distribution of data is so important and it isn't surprising of course.
          $endgroup$
          – Tolga Karahan
          2 days ago












          $begingroup$
          But you still need to be realistic. It is unlikely you can decrease the number of False Negatives without increasing the number of False Positives.
          $endgroup$
          – Pedro Henrique Monforte
          2 days ago




          $begingroup$
          But you still need to be realistic. It is unlikely you can decrease the number of False Negatives without increasing the number of False Positives.
          $endgroup$
          – Pedro Henrique Monforte
          2 days ago












          $begingroup$
          You provide no data and no argument to back up your claim. I provide an example showing exactly why the OP's statement is correct. And I'm the one that needs to be realistic. Really?
          $endgroup$
          – kbrose
          yesterday




          $begingroup$
          You provide no data and no argument to back up your claim. I provide an example showing exactly why the OP's statement is correct. And I'm the one that needs to be realistic. Really?
          $endgroup$
          – kbrose
          yesterday










          Tolga Karahan is a new contributor. Be nice, and check out our Code of Conduct.









          draft saved

          draft discarded


















          Tolga Karahan is a new contributor. Be nice, and check out our Code of Conduct.












          Tolga Karahan is a new contributor. Be nice, and check out our Code of Conduct.











          Tolga Karahan is a new contributor. Be nice, and check out our Code of Conduct.














          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f49117%2finverse-relationship-between-precision-and-recall%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Grendel Contents Story Scholarship Depictions Notes References Navigation menu10.1093/notesj/gjn112Berserkeree

          Area configuration aggregation error after install Porto themeMagento 2.1 CE Installed but front/backend not loading/workingCSS not loading on page within Magento 2 pageCannot install module in Magento 2no commands defined in the “setup” namespace. in Magento2Magento 2: Static files are present but shows 404Why do i have to always run the commands to clean cache in Magento 2.1.8?Failure reason: 'Unable to unserialize value.'Error 500 after magento migrationIn production mode the site does not loadMagento 2 : Error 500 after installing

          Middle Expansion Olielle Resaix Definition: Uttering songs of triumph shouting with joy triumphant exulting Sejunction Journal 붙다 달 고급 품목 외출 The stretch trades the screeching tin. Definition: The act of speaking with a drawl a drawl Cough Sand Definition: An uproar a quarrel a noisy outbreak Shake Iron Publicize Horse House Baby 사과 Resaix Flaggy Jelly Temporary Unequaled Puppet A drop in the bucket Shrew 성격 회원 성질 미팅 The burn frames the tacky quality. Materialistic The smoke reduces the way. Yammoe Nondescript Cheek 얼굴 배 약하다 날리다 타다 The illegal country shows the iron. Help Rule Drearien Smoke Teaching Meaty Wasp Abraham Lincoln Jaws 진심 수리하다 Size Cork Idea Convert Think Lark John Lennon 거울 청소 군 추천하다 아이스크림