Is it good to plot a scatterplot with weird regression line? [closed]How should outliers be dealt with in linear regression analysis?Weird residuals in linear regressionRemoving outliers from newspaper content analysisBest way to display data with outliers?Outlier detection/imputation - discussionOutliers identificationFinding outliers on a scatter plotProblem using funnel plot to detect outliersFitting time series with outliersTreatment of outliers in financial data
Why do most airliners have underwing engines, while business jets have rear-mounted engines?
Is conquering your neighbors to fight a greater enemy a valid strategy?
How did Captain Marvel do this without dying?
Do I need transit visa for Dublin?
Why do Klingons use cloaking devices?
Is this standard Japanese employment negotiations, or am I missing something?
Why do we need a bootloader separate from our application program in microcontrollers?
Why do people prefer metropolitan areas, considering monsters and villains?
Initializing variables in an "if" statement
comparing two addresses
How to reclaim personal item I've lent to the office without burning bridges?
An easy way to solve this limit of a sum?
Do grungs have a written language?
How to play a D major chord lower than the open E major chord on guitar?
Does the Milky Way orbit around anything?
When moving a unique_ptr into a lambda, why is it not possible to call reset?
Groups where no elements commute except for the trivial cases
Why did Super-VGA offer the 5:4 1280*1024 resolution?
Can a USB hub be used to access a drive from two devices?
Why does this function pointer assignment work when assigned directly but not with the conditional operator?
Chilling juice in copper vessel
Are host configurations in the SSH config merged?
Can you create a free-floating MASYU puzzle?
Why does "sattsehen" take accusative "mich", not dative "mir"? Even though it is not "me" that I'm looking at?
Is it good to plot a scatterplot with weird regression line? [closed]
How should outliers be dealt with in linear regression analysis?Weird residuals in linear regressionRemoving outliers from newspaper content analysisBest way to display data with outliers?Outlier detection/imputation - discussionOutliers identificationFinding outliers on a scatter plotProblem using funnel plot to detect outliersFitting time series with outliersTreatment of outliers in financial data
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;
$begingroup$
Am a beginner in data analysis and i feel that something about these graphs are wrong. Am not sure that the outliers are the issue or am doing this the wrong way. Thanks for the help
regression outliers scatterplot
$endgroup$
closed as unclear what you're asking by Jake Westfall, user158565, Alexis, Michael Chernick, mkt Jun 28 at 20:53
Please clarify your specific problem or add additional details to highlight exactly what you need. As it's currently written, it’s hard to tell exactly what you're asking. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.
add a comment |
$begingroup$
Am a beginner in data analysis and i feel that something about these graphs are wrong. Am not sure that the outliers are the issue or am doing this the wrong way. Thanks for the help
regression outliers scatterplot
$endgroup$
closed as unclear what you're asking by Jake Westfall, user158565, Alexis, Michael Chernick, mkt Jun 28 at 20:53
Please clarify your specific problem or add additional details to highlight exactly what you need. As it's currently written, it’s hard to tell exactly what you're asking. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.
1
$begingroup$
Can you provide some more information - 1. What is your outcome variable? (what are its units?) 2. Which model did you use? 3. Have you performed any model diagnostics?
$endgroup$
– NB12
Jun 27 at 6:28
$begingroup$
This is just plotting area of property to predict the price of that property. The 4 different graphs represents 4 different area plotted against the price of property, which are LivingArea, BasementArea, 1stFloorArea, and GarageArea respectively. I have not build any model around it yet.
$endgroup$
– Aaron
Jun 27 at 6:52
2
$begingroup$
Are you concerned about the changing spread of data about the line or something else? Note that when expected house price is 0 you'd expect the variance to also be 0 - so changing spread with changing expected price is hardly a surprise. I'd probably lean toward some form of gamma model as a first thought.
$endgroup$
– Glen_b♦
Jun 27 at 7:52
$begingroup$
unrelated but if you are trying to predict housing price data for any commercial reason you may want to read money.stackexchange.com/questions/110537/… from yesterday
$endgroup$
– MD-Tech
Jun 27 at 14:17
add a comment |
$begingroup$
Am a beginner in data analysis and i feel that something about these graphs are wrong. Am not sure that the outliers are the issue or am doing this the wrong way. Thanks for the help
regression outliers scatterplot
$endgroup$
Am a beginner in data analysis and i feel that something about these graphs are wrong. Am not sure that the outliers are the issue or am doing this the wrong way. Thanks for the help
regression outliers scatterplot
regression outliers scatterplot
asked Jun 27 at 4:34
AaronAaron
235 bronze badges
235 bronze badges
closed as unclear what you're asking by Jake Westfall, user158565, Alexis, Michael Chernick, mkt Jun 28 at 20:53
Please clarify your specific problem or add additional details to highlight exactly what you need. As it's currently written, it’s hard to tell exactly what you're asking. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.
closed as unclear what you're asking by Jake Westfall, user158565, Alexis, Michael Chernick, mkt Jun 28 at 20:53
Please clarify your specific problem or add additional details to highlight exactly what you need. As it's currently written, it’s hard to tell exactly what you're asking. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.
1
$begingroup$
Can you provide some more information - 1. What is your outcome variable? (what are its units?) 2. Which model did you use? 3. Have you performed any model diagnostics?
$endgroup$
– NB12
Jun 27 at 6:28
$begingroup$
This is just plotting area of property to predict the price of that property. The 4 different graphs represents 4 different area plotted against the price of property, which are LivingArea, BasementArea, 1stFloorArea, and GarageArea respectively. I have not build any model around it yet.
$endgroup$
– Aaron
Jun 27 at 6:52
2
$begingroup$
Are you concerned about the changing spread of data about the line or something else? Note that when expected house price is 0 you'd expect the variance to also be 0 - so changing spread with changing expected price is hardly a surprise. I'd probably lean toward some form of gamma model as a first thought.
$endgroup$
– Glen_b♦
Jun 27 at 7:52
$begingroup$
unrelated but if you are trying to predict housing price data for any commercial reason you may want to read money.stackexchange.com/questions/110537/… from yesterday
$endgroup$
– MD-Tech
Jun 27 at 14:17
add a comment |
1
$begingroup$
Can you provide some more information - 1. What is your outcome variable? (what are its units?) 2. Which model did you use? 3. Have you performed any model diagnostics?
$endgroup$
– NB12
Jun 27 at 6:28
$begingroup$
This is just plotting area of property to predict the price of that property. The 4 different graphs represents 4 different area plotted against the price of property, which are LivingArea, BasementArea, 1stFloorArea, and GarageArea respectively. I have not build any model around it yet.
$endgroup$
– Aaron
Jun 27 at 6:52
2
$begingroup$
Are you concerned about the changing spread of data about the line or something else? Note that when expected house price is 0 you'd expect the variance to also be 0 - so changing spread with changing expected price is hardly a surprise. I'd probably lean toward some form of gamma model as a first thought.
$endgroup$
– Glen_b♦
Jun 27 at 7:52
$begingroup$
unrelated but if you are trying to predict housing price data for any commercial reason you may want to read money.stackexchange.com/questions/110537/… from yesterday
$endgroup$
– MD-Tech
Jun 27 at 14:17
1
1
$begingroup$
Can you provide some more information - 1. What is your outcome variable? (what are its units?) 2. Which model did you use? 3. Have you performed any model diagnostics?
$endgroup$
– NB12
Jun 27 at 6:28
$begingroup$
Can you provide some more information - 1. What is your outcome variable? (what are its units?) 2. Which model did you use? 3. Have you performed any model diagnostics?
$endgroup$
– NB12
Jun 27 at 6:28
$begingroup$
This is just plotting area of property to predict the price of that property. The 4 different graphs represents 4 different area plotted against the price of property, which are LivingArea, BasementArea, 1stFloorArea, and GarageArea respectively. I have not build any model around it yet.
$endgroup$
– Aaron
Jun 27 at 6:52
$begingroup$
This is just plotting area of property to predict the price of that property. The 4 different graphs represents 4 different area plotted against the price of property, which are LivingArea, BasementArea, 1stFloorArea, and GarageArea respectively. I have not build any model around it yet.
$endgroup$
– Aaron
Jun 27 at 6:52
2
2
$begingroup$
Are you concerned about the changing spread of data about the line or something else? Note that when expected house price is 0 you'd expect the variance to also be 0 - so changing spread with changing expected price is hardly a surprise. I'd probably lean toward some form of gamma model as a first thought.
$endgroup$
– Glen_b♦
Jun 27 at 7:52
$begingroup$
Are you concerned about the changing spread of data about the line or something else? Note that when expected house price is 0 you'd expect the variance to also be 0 - so changing spread with changing expected price is hardly a surprise. I'd probably lean toward some form of gamma model as a first thought.
$endgroup$
– Glen_b♦
Jun 27 at 7:52
$begingroup$
unrelated but if you are trying to predict housing price data for any commercial reason you may want to read money.stackexchange.com/questions/110537/… from yesterday
$endgroup$
– MD-Tech
Jun 27 at 14:17
$begingroup$
unrelated but if you are trying to predict housing price data for any commercial reason you may want to read money.stackexchange.com/questions/110537/… from yesterday
$endgroup$
– MD-Tech
Jun 27 at 14:17
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
It doesn't seem wrong. The linear fit passes through the data reasonably well. If you're concerned about the spread around the fitted line in the region with less data, it actually reflects the uncertainty in the model around different domains; which is typically calculated via Bayesian Linear Regression.
$endgroup$
4
$begingroup$
I wonder how relevant, valid and/or correct the point about Bayesian linear regression is.
$endgroup$
– Richard Hardy
Jun 27 at 13:07
1
$begingroup$
@RichardHardy have you done some research to you find out if there's an issue?
$endgroup$
– David
Jun 27 at 13:48
2
$begingroup$
@David, ideally, I would have. However, given the limited resources, I chose not to. There is also the question of who should bring the evidence: one that is claiming a fact without supporting evidence or one that does not quite believe it? I would tilt towards the former. Based on my current state of knowledge, I have some doubts that I have politely expressed in the form of "I wonder", just to give the author a heads up. Perhaps the statement was not thought through and the author would change it. Or if it is correct, then it would be reassuring to at least get a brief confirmation.
$endgroup$
– Richard Hardy
Jun 27 at 14:10
add a comment |
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
It doesn't seem wrong. The linear fit passes through the data reasonably well. If you're concerned about the spread around the fitted line in the region with less data, it actually reflects the uncertainty in the model around different domains; which is typically calculated via Bayesian Linear Regression.
$endgroup$
4
$begingroup$
I wonder how relevant, valid and/or correct the point about Bayesian linear regression is.
$endgroup$
– Richard Hardy
Jun 27 at 13:07
1
$begingroup$
@RichardHardy have you done some research to you find out if there's an issue?
$endgroup$
– David
Jun 27 at 13:48
2
$begingroup$
@David, ideally, I would have. However, given the limited resources, I chose not to. There is also the question of who should bring the evidence: one that is claiming a fact without supporting evidence or one that does not quite believe it? I would tilt towards the former. Based on my current state of knowledge, I have some doubts that I have politely expressed in the form of "I wonder", just to give the author a heads up. Perhaps the statement was not thought through and the author would change it. Or if it is correct, then it would be reassuring to at least get a brief confirmation.
$endgroup$
– Richard Hardy
Jun 27 at 14:10
add a comment |
$begingroup$
It doesn't seem wrong. The linear fit passes through the data reasonably well. If you're concerned about the spread around the fitted line in the region with less data, it actually reflects the uncertainty in the model around different domains; which is typically calculated via Bayesian Linear Regression.
$endgroup$
4
$begingroup$
I wonder how relevant, valid and/or correct the point about Bayesian linear regression is.
$endgroup$
– Richard Hardy
Jun 27 at 13:07
1
$begingroup$
@RichardHardy have you done some research to you find out if there's an issue?
$endgroup$
– David
Jun 27 at 13:48
2
$begingroup$
@David, ideally, I would have. However, given the limited resources, I chose not to. There is also the question of who should bring the evidence: one that is claiming a fact without supporting evidence or one that does not quite believe it? I would tilt towards the former. Based on my current state of knowledge, I have some doubts that I have politely expressed in the form of "I wonder", just to give the author a heads up. Perhaps the statement was not thought through and the author would change it. Or if it is correct, then it would be reassuring to at least get a brief confirmation.
$endgroup$
– Richard Hardy
Jun 27 at 14:10
add a comment |
$begingroup$
It doesn't seem wrong. The linear fit passes through the data reasonably well. If you're concerned about the spread around the fitted line in the region with less data, it actually reflects the uncertainty in the model around different domains; which is typically calculated via Bayesian Linear Regression.
$endgroup$
It doesn't seem wrong. The linear fit passes through the data reasonably well. If you're concerned about the spread around the fitted line in the region with less data, it actually reflects the uncertainty in the model around different domains; which is typically calculated via Bayesian Linear Regression.
answered Jun 27 at 4:43
gunesgunes
11k1 gold badge4 silver badges19 bronze badges
11k1 gold badge4 silver badges19 bronze badges
4
$begingroup$
I wonder how relevant, valid and/or correct the point about Bayesian linear regression is.
$endgroup$
– Richard Hardy
Jun 27 at 13:07
1
$begingroup$
@RichardHardy have you done some research to you find out if there's an issue?
$endgroup$
– David
Jun 27 at 13:48
2
$begingroup$
@David, ideally, I would have. However, given the limited resources, I chose not to. There is also the question of who should bring the evidence: one that is claiming a fact without supporting evidence or one that does not quite believe it? I would tilt towards the former. Based on my current state of knowledge, I have some doubts that I have politely expressed in the form of "I wonder", just to give the author a heads up. Perhaps the statement was not thought through and the author would change it. Or if it is correct, then it would be reassuring to at least get a brief confirmation.
$endgroup$
– Richard Hardy
Jun 27 at 14:10
add a comment |
4
$begingroup$
I wonder how relevant, valid and/or correct the point about Bayesian linear regression is.
$endgroup$
– Richard Hardy
Jun 27 at 13:07
1
$begingroup$
@RichardHardy have you done some research to you find out if there's an issue?
$endgroup$
– David
Jun 27 at 13:48
2
$begingroup$
@David, ideally, I would have. However, given the limited resources, I chose not to. There is also the question of who should bring the evidence: one that is claiming a fact without supporting evidence or one that does not quite believe it? I would tilt towards the former. Based on my current state of knowledge, I have some doubts that I have politely expressed in the form of "I wonder", just to give the author a heads up. Perhaps the statement was not thought through and the author would change it. Or if it is correct, then it would be reassuring to at least get a brief confirmation.
$endgroup$
– Richard Hardy
Jun 27 at 14:10
4
4
$begingroup$
I wonder how relevant, valid and/or correct the point about Bayesian linear regression is.
$endgroup$
– Richard Hardy
Jun 27 at 13:07
$begingroup$
I wonder how relevant, valid and/or correct the point about Bayesian linear regression is.
$endgroup$
– Richard Hardy
Jun 27 at 13:07
1
1
$begingroup$
@RichardHardy have you done some research to you find out if there's an issue?
$endgroup$
– David
Jun 27 at 13:48
$begingroup$
@RichardHardy have you done some research to you find out if there's an issue?
$endgroup$
– David
Jun 27 at 13:48
2
2
$begingroup$
@David, ideally, I would have. However, given the limited resources, I chose not to. There is also the question of who should bring the evidence: one that is claiming a fact without supporting evidence or one that does not quite believe it? I would tilt towards the former. Based on my current state of knowledge, I have some doubts that I have politely expressed in the form of "I wonder", just to give the author a heads up. Perhaps the statement was not thought through and the author would change it. Or if it is correct, then it would be reassuring to at least get a brief confirmation.
$endgroup$
– Richard Hardy
Jun 27 at 14:10
$begingroup$
@David, ideally, I would have. However, given the limited resources, I chose not to. There is also the question of who should bring the evidence: one that is claiming a fact without supporting evidence or one that does not quite believe it? I would tilt towards the former. Based on my current state of knowledge, I have some doubts that I have politely expressed in the form of "I wonder", just to give the author a heads up. Perhaps the statement was not thought through and the author would change it. Or if it is correct, then it would be reassuring to at least get a brief confirmation.
$endgroup$
– Richard Hardy
Jun 27 at 14:10
add a comment |
1
$begingroup$
Can you provide some more information - 1. What is your outcome variable? (what are its units?) 2. Which model did you use? 3. Have you performed any model diagnostics?
$endgroup$
– NB12
Jun 27 at 6:28
$begingroup$
This is just plotting area of property to predict the price of that property. The 4 different graphs represents 4 different area plotted against the price of property, which are LivingArea, BasementArea, 1stFloorArea, and GarageArea respectively. I have not build any model around it yet.
$endgroup$
– Aaron
Jun 27 at 6:52
2
$begingroup$
Are you concerned about the changing spread of data about the line or something else? Note that when expected house price is 0 you'd expect the variance to also be 0 - so changing spread with changing expected price is hardly a surprise. I'd probably lean toward some form of gamma model as a first thought.
$endgroup$
– Glen_b♦
Jun 27 at 7:52
$begingroup$
unrelated but if you are trying to predict housing price data for any commercial reason you may want to read money.stackexchange.com/questions/110537/… from yesterday
$endgroup$
– MD-Tech
Jun 27 at 14:17