Don Wise Oct 13, 2018 11:59 AM (in response to visakha chan)Hello Visakha,
You'll need to upload a .twbx file in order to get some help.
You'll need to upload a .twbx file in order to get some help. You've uploaded a .twb file. The .twbx file is a packaged workbook which contains the data. You've only uploaded the XML schema.

ShivaRam Chennapragada Oct 13, 2018 12:05 PM (in response to visakha chan)Scatter plot is one option to visualize Sales vs Quantities along with trend line. Attached an example.

Scatter Plot_v10.2.twbx 2.4 MB


visakha chan Oct 13, 2018 1:34 PM (in response to visakha chan)it said my file as .twbx is too large, i cannot upload it.

visakha chan Oct 13, 2018 1:34 PM (in response to ShivaRam Chennapragada)By looking at the graph , what do you say about relationship between sale vs quantity? I am really new to this data analysis. Thanks.

patrick.byrne.0 Oct 13, 2018 1:39 PM (in response to visakha chan)Hello Visakha,
Try adding a data source filter and then extracting a smaller subset of the data to be shared.
Cheers,
Byrne, Patrick

ShivaRam Chennapragada Oct 13, 2018 3:31 PM (in response to visakha chan)1 of 1 people found this helpfulVisakha  Without getting into much details I'll keep this simple. The example I've used compares the correlation between Sales Qty and Sales Amount for all the Products across all 3 Product Categories.
When performing Linear Regression as this, it is important to note 3 things 
1) Slope (y=mx+b)
y is dependent variable; x is explanatory, m is slope of the line, and b is Y intercept.
Introduction to Linear Regression
2) RSquared Value (tells me how confident my model fits, the higher the better but not always),
Regression Analysis: How Do I Interpret Rsquared and Assess the GoodnessofFit?
3) pvalue.
What a pValue Tells You about Statistical Data  dummies
In my model, the RSquared values for all three categories is pretty less (worse), which means there isn't much correlation except the fact that although my quantity is less, due to the high price, my sales amount is much higher.
Refer to this article for more on this topic,
Add Trend Lines to a Visualization
Happy learning.
Best,
Shiva.

visakha chan Oct 13, 2018 4:17 PM (in response to ShivaRam Chennapragada)I really appreciate your explanation. Since it is first time using this software, I really have hard time trying to understand the PValue.
Trend Lines Model
A linear trend model is computed for sum of Sales Dollars given sum of Sales Quantity. The model may be significant at p <= 0.05.
Model formula:
( Sales Quantity + intercept )
Number of modeled observations:
11515
Number of filtered observations:
0
Model degrees of freedom:
2
Residual degrees of freedom (DF):
11513
SSE (sum squared error):
1.90963e+14
MSE (mean squared error):
1.65867e+10
RSquared:
0.720018
Standard error:
128789
pvalue (significance):
< 0.0001

visakha chan Oct 13, 2018 5:33 PM (in response to patrick.byrne.0)I don't know how to do that. I have never used this software before.

ShivaRam Chennapragada Oct 13, 2018 8:18 PM (in response to visakha chan)Visakha I will try to put here what I remember from my Stats class. PValue is probability value in Statistics. So as an Analyst before diving into the data you'd have a hypothesis and your goal of analysis would be to prove or disprove the hypothesis in either case you back it up with some results. That's when measures like PValue or RSquared value come in handy.
In this example, you'd think that there is a strong correlation between Sales Quantity and Sales Amount. This is your hypothesis and in statistical terms it is called as "Null Hypothesis". Then there's an "Alternative Hypothesis" that states quite opposite. Your aim when constructing the graph and adding a trend line and thereafter analyzing the test results is to either reject or support the "Null Hypothesis". How would you do that is, using PValue. A pvalue is always between 0 and 1.
1) If Pvalue < 0.05, then the analysis returns a strong evidence against your claim that Qty and Sales Amount are correlated. Hence you'd have to reject "Null Hypothesis"
2) If PValue > 0.05, then you cannot reject "Null Hypothesis" (opposite to 1)
If it is somewhere in the middle you'll have to be cautious.
Coming back to the example, Tableau returned a pvalue ( < 0.0001) which is far less than 0.05. This means there is very very little evidence that Qty and Amounts are correlated.
Similarly, RSquared value gives you the confidence level of supporting the claim. In our example again, this value is very low.
Hope this explanation helps.
Best,
Shiva.

ShivaRam Chennapragada Oct 13, 2018 8:19 PM (in response to visakha chan)If you have further questions, please attach a packaged workbook as Patrick suggested. And how to apply data source filters is to go to the data pane,
Click on filters on top right.
Click on the Add button, and from the popup limit your data using dimension filters (eg date)
Filter Data from Your Data Source
Best,
Shiva.