PORTAL USER GUIDE

Linear Regression Plot (Multiple)

The Linear Regression Plot (Multiple) tool allows you to produce a chart displaying a simple linear regression of one variable against multiple others. A linear regression plot allows you to visualise your variables with more information than that of the Scatter Plot. The linear regression plot shows you the mathematical relationship between the variables. It is important to use the linear regression plot when you have an idea about causation between your variables, rather than just a correlation.

Simple linear regression plot is a scatter plot with a regression line that shows the general direction (trend) that a group of points (plots) seem to be heading. The plots show the relationship between a scalar dependent variable Y and an explanatory variable denoted X.

This tool is able to plot multiple dependent variables against a single explanatory variable.

SET UP

To illustrate the Linear Regression Plot (Single) tool, we will run it on a dataset within Tasmania to visualise the connection between fruit intake and obesity in children. Prepare the context by:

  • Select Tasmania as your area.
  • Select PHIDU – Prevalence of Selected Health Risk Factors – Children and Youth (LGA) 2014-2015 as your dataset with the following attributes:
    • LGA Code 2016
    • LGA Name 2016
    • Estimated Number Of People Aged 4-17 Years With Adequate Fruit Intake (Modelled Estimates) 2014-15 ASR per 100
    • Estimated Number Of Children Aged 2-17 Years Who Were Obese (Modelled Estimates)* 2014-15 ASR per 100
    • Estimated Number Of Children Aged 2-17 Years Who Were Overweight (But Not Obese) (Modelled Estimates) 2014-15 ASR per 100

Once you have added the dataset and the selected attributes, you are ready to use the Linear Regression Plot (Multiple) tool – follow on to learn about the input options.

Inputs

Once you have set up your data, open the Linear Regression Plot (Multiple) tool (Tools → Charts → Linear Regression Plot (Multiple)). The input fields are as follows:

  • Dataset Input: The dataset containing the variables that you would like to run through the tool. Select PHIDU – Prevalence of Selected Health Risk Factors – Children and Youth (LGA) 2014-2015
  • Dependent Variable: This is where we select the variable that we want to model or regress on the independent variable. Select the following variables:
    • Estimated Number Of Children Aged 2-17 Years Who Were Overweight (But Not Obese) (Modelled Estimates) 2014-15 ASR per 100
    • Estimated Number Of Children Aged 2-17 Years Who Were Obese (Modelled Estimates)* 2014-15 ASR per 100
  • Independent Variable: This is where we select the variable that we want to test as the predictor for the dependent variable. Select Estimated Number Of People Aged 4-17 Years With Adequate Fruit Intake (Modelled Estimates) 2014-15 ASR per 100
  • Use Variable Titles: Check this box to have “human-readable names” on your output chart. Tick this box.
  • Chart Title: This is where we enter the name that we want to give the resultant plot. Type Relationship between fruit intake and obesity in children.
  • Show Gridlines: Check this box if you want gridlines for your graph. Tick this box.
  • Legend: Check this box if you want to include a legend for your graph. Tick this box.
  • Greyscale: Check this box if you want your graph to be produced in greyscale. Untick this box.

A summary of the inputs to explore our case can be viewed in the image below, once complete click Run Tool.

Outputs

Once you have run the tool, click the Display Output button which appears in the pop-up dialogue box. This should open up a chart tool looking like the one shown below.

The output provides the estimated linear regression linear equation and indicates a negative relationship between adequate fruit intake in children and both childhood overweightness and obesity.

 

Looking for Spatial Data?

You can browse the AURIN Data Discovery:

How can you Create Impact?

Learn more about AURIN Researcher's outcomes & real-world impact: