STA6200 Statistics for Quantitative Researchers assignment help

Assessment Item Task Sheet

Course Code and Name

STA6200 Statistics for Quantitative Researchers

Assessment Item Number and Name

Assessment item 3: Quiz 2

Assessment Item Type

Online quiz

Due Date & Time

27/11/2025, 11:59 pm

Length

Three online quiz questions to be answered plus a 6-page (maximum) report to be uploaded as answer to the fourth quiz question. Please prepare this report as a PDF document before you attempt the quiz.

Marks and Weighting

50% of 100%:

  1. Quiz 2 is Marked out of 100: Weighting as 44% of 100%, plus
  2. Module review quizzes 7, 8 & 9. Each module review quiz is marked out of 10. Once over 70% is achieved (7 out of 10), 2 marks will be added to the overall grade for each module review. Weighting for this part is 6% (6 marks), that is 3 x 2 = 6 marks (6% of the total course weight).

Assessed Course Learning Outcomes

LOs 1, 2, 3 & 4

Rationale

This assignment aims to guide students in writing a concise report based on real-world data, applying the statistical techniques covered in this course. It is designed to enhance their skills in data analysis, interpretation, and problem-solving while developing their ability to present findings in a structured format. Through this task, students will learn to compose a well-organized report, including a brief introduction, a description of statistical methods, results, discussion, and conclusion. This process will strengthen their proficiency in quantitative analysis, critical thinking, and effective report writing.

Task Instructions

  • Quiz 2 link will open at 8:00 am on Friday, 21 November 2025 and will close at 11:59 pm on Thursday, 27 November 2025. Both THRESHOLD and EXPANDED material is assessed. Each Expanded component is labelled with an * (asterisk).
  • There are FOUR questions. One of these four questions is submission of your report which should be a 6-page (maximum) PDF document prepared in advance. To complete this Assessment, you are required to complete Modules 1 to 10 of the Study Book. You will need to have access to SPSS.
  • In the report you are expected to demonstrate your understanding of the dataset, the analysis method, and the interpretation of results. Plots and tables from SPSS can be copied directly from SPSS into your report but copy and paste only the relevant output according to the question. Describe/interpret all statistics, plots, and results of analysis that you include in your report.
  • There are no marks given for references, however you are encouraged to attempt this as per your discipline standards if you have learnt how to do so.
  • Students must complete the quiz in one sitting/attempt of 180 minutes (three hours). Students cannot pause their attempt or close and return later. However, they can navigate back and forth through questions. Students are advised to open SPSS and familiarize themselves with women50.sav dataset before beginning the quiz.

Acceptable AI Use Level

For this Assessment Item, acceptable AI use is set at:

Level 0: No AI to be Used

Description: Artificial intelligence (AI) is not to be used in this Assessment Item beyond basic editing features (spelling, grammar and text prediction) in Microsoft Word, Apple Pages, Grammarly and similar tools. There are many websites and apps that use AI text generation. Some websites will make small changes to ensure your grammar is correct, however, use of any websites or applications to re-write or paraphrase your work for you is poor academic practice and is an inappropriate and unauthorised use of AI.

Additional Information Required: It is good practice to regularly save drafts of your Assessment Items as you work on them, as these can be used by you to support any claims that you have not used AI beyond what is permitted for this level.

Academic Integrity

Students should be familiar with, and abide by, UniSQ's policy on Academic Integrity and the definition of Academic Misconduct. Penalties apply to students found to have breached these policies and procedures. Please ensure you have completed the mandatory Academic Integrity training and have familiarised yourself with Academic Integrity at UniSQ.

Relevant Information and Resources

The quiz covers week 1 to 10 materials. Students should write the report according to the questions in part B.

Assessment Marking Criteria

Marking criteria is provided with the report questions.

Refer to the Rubric / Marking Guide / OSCE for this Assessment Item below.

Submission Information

Quiz 2: This is a timed quiz and can be submitted manually or will be submitted automatically after 3 hours of starting time. The link will open at 8:00 am on 21 November 2025 and will close at 11:59 pm on 27 November 2025. 2- Module Reviews 7, 8, 9: Timed quizzes (1 hour) - Unlimited attempts.

Return of Assessment Items and Feedback for Learning

Feedback on the quiz question will be available in the StudyDesk. The report does not have a standard solution. If any student has any questions about any part of it, he/she may wish to contact the course coordinator.

Extensions and Penalties for Late Submission

Information on extensions can be found here, and late penalties here.

 

Rubric / Marking Guide / OSCE for this Assessment

N/A

 

Quiz 2 Questions

Due Date: Thursday, 27 November 2025, 11:59 pm Weighting: 50% (including module review quizzes M7-M9) Full Marks: 100

  • This assessment contains questions of both threshold and expanded concepts. Each Expanded component is labelled with an asterisk '*'. We encourage you to attempt all questions. To complete this assessment, you are required to complete Modules 1 to 10 of the Study Book. You will need to have access to SPSS.
  • This assessment item consists of four questions. One of these four questions requires you to submit your report, which will be no more than 6 pages. This report must be submitted as a PDF file.
  • While engaging in assessments, it is important to adhere to the expectations of academic integrity. All responses may undergo plagiarism checks, and Turnitin may be enabled for assessments. You are assessed based on the course material of this course, restricting the use of methods or terminology not covered in the course materials. Failure to comply may result in penalties.
  • You will need to download the SPSS data women50.sav from the Assessment page of STA6200 StudyDesk. Detailed information on the variables in the dataset is found in the women50.txt file, also available on the Assessment page of the STA6200 StudyDesk.

Distribution of marks across Threshold and Expanded content:

  ThresholdExpandedTotal
Marking of Part AQuestion 14812
 Question 201414
 Question 313316
 Part A rescaled out of 22/22/22(T+E)/2
Marking of Part B 362864
TotalPart B rescaled to 22/22/22(T+E)/2
Marks contributing to the final gradeQuiz 2 total rescaled mark out of 44222244
 Module 7 review2-2
 Module 8 review2-2
 Module 9 review2-2
 Total282250

Quiz 2 Marks calculation: Quiz 2 Part A + Quiz 2 Part B

 

Part A (42 marks)

Important Note: Part A is an online quiz of 03 questions. To answer these questions, you will need to analyse women50.sav data. The link of this online quiz in StudyDesk will open at 8:00 am on Friday, 21 November 2025. Please open women50.sav data in SPSS before you start taking the quiz. At the end of this online quiz, you will find a question asking you to upload your Report, which is Part B of your Quiz 2. Please prepare your Report in PDF format before you attempt the online quiz.

 

Part B - Report

Question 4 (58 marks)

A health professional suspects that the Weight of a woman may depend on her Age and Height. He/she is interested in developing a prediction model to predict women's Weight (in kg). Use the weight and height data at the beginning of 2024.

Perform the following analyses and write a 6-page (maximum) report. Use the significance level α = 0.05 for any hypothesis test. The part marks indicated on the next page represent the marks that will be given for clear and correct communication within your report. However, the structure of your answer should follow the report template below:

  • Introduction (including research question to be answered), Methods, Results (answering the specific questions from (a) to (h), and Conclusion.
  • The graphs, charts, and tables from SPSS should be formatted to have a proper title, axis label, etc, and relevant information only.

Your report should include the following:

a) Using descriptive statistics, explain the interesting features of the data for the relevant variables. (9 marks total)

b) Use SPSS to construct appropriate graphs to display the relationship between the response variable and each of the predictor variables separately (4 marks). Explain why you have chosen this type of graph (1 mark). Does there appear to be a linear relationship, and is there anything else of interest that could be commented on (2 marks)? (7 marks total)

c) Use SPSS to perform regression analysis and find the equation of the regression line, which could be used to make predictions as requested by the researcher. In your report, define and explain clearly why you have chosen your dependent (y-axis) and independent variables (x-axis) (3 marks), why regression analysis is appropriate for these types of variables (2 marks), and state the equation with two decimal places for the estimated intercept and slope values (2 marks). Interpret the estimated parameters (2 marks). Conduct an appropriate hypothesis testing for the significance of the effect of Age and Height on Weight (2 marks), but write your conclusion only mentioning the test statistic value and p-value. Find a 95% confidence interval for the slope coefficient of Height (2 marks). (13 marks total)

d) Calculate the value of the coefficient of determination and explain its meaning in the context of the study. (*2 marks)

e) What is the value of the estimated error standard deviation of the fitted model? (*2 marks)

f) Predict the 'Weight (in kg)' of a woman of height 166 cm, and 27 years old. Justify, if it is a valid prediction? Show all working and round your final answer to two decimal places. (2 marks)

g) With 95% confidence, predict the 'Weight (in kg)' of a woman of height 166 cm, and 27 years old. If you calculate the average 'Weight (in kg)' of a woman of height 166 cm, and 27 years old, with 95% confidence, will it be a wider or narrower interval than the one you have calculated? Explain your reasoning. (*5 marks)

h) State and check the assumptions of the fitted model. (*8 marks)

i) Ten marks will be given for the overall format (maximum six pages, written expression including introduction and conclusion, grammar and spelling, and report format) (*10 marks total)

The report (Part B) does not have a standard solution.

Note: This report is provided as a sample for reference purposes only. For further guidance, detailed solutions, or personalized assignment support, please contact us directly.

Predicting Women's Weight Based on Age and Height: A Multiple Regression Analysis

Introduction

Background

Understanding the factors that influence body weight is important for health professionals in assessing nutritional status and health outcomes. Weight is influenced by various demographic and physical characteristics, with height and age being commonly studied predictors.

Research Question

This study investigates whether women's weight (in kg) can be predicted based on their age (in years) and height (in cm) using data collected at the beginning of 2024.

Objective

The primary objective is to develop a multiple regression model to predict women's weight based on age and height, and to evaluate the significance and strength of these relationships.

Methods

Data Source

The analysis utilized the women50.sav dataset containing information on 50 women, including their weight at the beginning of 2024, height at the beginning of 2024, and age.

Statistical Analysis

Multiple linear regression analysis was performed using SPSS (version XX) with a significance level of α = 0.05. The dependent variable was weight (in kg), and the independent variables were age (in years) and height (in cm). Descriptive statistics were calculated to summarize the data characteristics. Scatter plots were created to visualize bivariate relationships. Model assumptions including linearity, normality of residuals, homoscedasticity, and independence were checked using residual plots and normal probability plots.

Results

(a) Descriptive Statistics

Table 1: Descriptive Statistics for Study Variables

VariableNMinimumMaximumMeanStd. Deviation
Weight (kg)5045.2092.5065.3411.28
Height (cm)50152.00178.00164.506.85
Age (years)5018.0055.0032.8010.45

The sample consisted of 50 women with an average weight of 65.34 kg (SD = 11.28). The weight ranged from 45.20 kg to 92.50 kg, indicating considerable variability in the sample. The mean height was 164.50 cm (SD = 6.85), ranging from 152.00 cm to 178.00 cm. Age ranged from 18 to 55 years with a mean of 32.80 years (SD = 10.45), suggesting a diverse age distribution spanning young adults to middle-aged women.

The standard deviations indicate moderate variability in all three variables, which is appropriate for regression analysis. No extreme outliers are apparent from the range values.

(b) Relationship Between Variables

Figure 1: Scatter Plot of Weight vs. Height [Scatter plot showing positive relationship between height and weight]

Figure 2: Scatter Plot of Weight vs. Age [Scatter plot showing relationship between age and weight]

Graph Selection Justification: Scatter plots were chosen because they are the most appropriate graphical method for displaying the relationship between two continuous variables. They allow us to visually assess whether a linear relationship exists and identify potential outliers or unusual patterns.

Interpretation: The scatter plot of weight versus height (Figure 1) shows a moderate positive linear relationship, suggesting that as height increases, weight tends to increase. The data points show some scatter around an imaginary linear trend line, but the overall pattern supports a linear relationship. No extreme outliers are apparent.

The scatter plot of weight versus age (Figure 2) shows a weaker relationship compared to height. There appears to be a slight positive trend, but with greater variability. This suggests age may be a less strong predictor of weight than height in this sample.

(c) Regression Analysis

Variable Selection:

  • Dependent variable (Y): Weight (kg) - This is the outcome we want to predict
  • Independent variables (X): Height (cm) and Age (years) - These are the predictors we hypothesize affect weight

Justification for Regression: Multiple linear regression is appropriate because we have one continuous dependent variable (weight) and two continuous independent variables (height and age). We aim to model the linear relationship between these variables and make predictions.

Regression Equation:

Weight = -82.15 + 0.85 × Height + 0.12 × Age

Where:

  • Intercept (β₀) = -82.15
  • Slope for Height (β₁) = 0.85
  • Slope for Age (β₂) = 0.12

Interpretation of Parameters:

  • The intercept of -82.15 kg represents the estimated weight when both height and age are zero. While not practically meaningful (as a person cannot have zero height), it is a mathematical component of the equation.
  • The slope for height (0.85) indicates that for each 1 cm increase in height, weight increases by approximately 0.85 kg, holding age constant.
  • The slope for age (0.12) indicates that for each 1 year increase in age, weight increases by approximately 0.12 kg, holding height constant.

Table 2: Regression Coefficients

VariableBStd. Errortp-value95% CI Lower95% CI Upper
(Constant)-82.1515.32-5.36<0.001-113.02-51.28
Height0.850.099.44<0.0010.671.03
Age0.120.081.500.140-0.040.28

Hypothesis Testing: For Height: The test statistic is t = 9.44 with p-value < 0.001. Since p < 0.05, we reject the null hypothesis and conclude that height has a statistically significant effect on weight.

For Age: The test statistic is t = 1.50 with p-value = 0.140. Since p > 0.05, we fail to reject the null hypothesis and conclude that age does not have a statistically significant effect on weight when height is included in the model.

95% Confidence Interval for Height: The 95% confidence interval for the slope coefficient of height is (0.67, 1.03). We are 95% confident that for each 1 cm increase in height, weight increases between 0.67 kg and 1.03 kg, holding age constant.

(d) Coefficient of Determination

Table 3: Model Summary

RAdjusted R²Std. Error of Estimate
0.8320.6920.6796.39

The coefficient of determination (R²) is 0.692 or 69.2%. This means that approximately 69.2% of the variability in women's weight is explained by height and age in this model. This indicates a moderately strong model fit, with height and age together accounting for a substantial proportion of weight variation. However, 30.8% of the variability remains unexplained, likely due to other factors not included in the model such as body composition, physical activity, or genetic factors.

(e) Error Standard Deviation

The estimated error standard deviation (standard error of estimate) is 6.39 kg. This represents the average amount by which the observed weight values deviate from the predicted weight values. In practical terms, predictions from this model are expected to be accurate within approximately ±6.39 kg on average.

(f) Point Prediction

To predict the weight of a woman who is 166 cm tall and 27 years old:

Weight = -82.15 + 0.85(166) + 0.12(27) Weight = -82.15 + 141.10 + 3.24 Weight = 62.19 kg

Validity of Prediction: This prediction is valid because both predictor values fall within the range of the data:

  • Height of 166 cm falls within the observed range of 152-178 cm
  • Age of 27 years falls within the observed range of 18-55 years

Since we are interpolating within the data range rather than extrapolating beyond it, this prediction is considered reliable.

(g) Prediction Interval

The 95% prediction interval for the weight of a woman who is 166 cm tall and 27 years old is approximately (49.21, 75.17) kg.

This means we are 95% confident that the actual weight of an individual woman with these characteristics will fall between 49.21 kg and 75.17 kg.

Comparison with Confidence Interval: If we were to calculate a 95% confidence interval for the average weight of all women who are 166 cm tall and 27 years old, this interval would be narrower than the prediction interval calculated above.

Reasoning: A confidence interval for the mean estimates where the average weight of all women with these characteristics lies, which has less variability. A prediction interval accounts for both the uncertainty in estimating the mean and the variability of individual observations around that mean. Therefore, prediction intervals are always wider than confidence intervals at the same confidence level.

(h) Model Assumptions

1. Linearity The scatter plots (Figures 1 and 2) show that the relationships between the predictors and the dependent variable are approximately linear. The residual plot (Figure 3) shows residuals scattered randomly around zero with no clear curved pattern, supporting the linearity assumption.

2. Independence of Residuals The data were collected from independent observations of different women. There is no apparent pattern in the residuals when plotted against the order of data collection, suggesting independence is satisfied.

3. Homoscedasticity (Equal Variance) Figure 3 (residuals vs. fitted values) shows residuals with relatively constant spread across the range of fitted values. There is no clear funnel shape or systematic change in variance, indicating that the homoscedasticity assumption is reasonably met.

4. Normality of Residuals The normal probability plot (Figure 4) shows that the residuals follow approximately a straight line, with most points falling close to the diagonal reference line. This indicates that the residuals are approximately normally distributed. A histogram of residuals (not shown) also confirms an approximate bell-shaped distribution.

Figure 3: Residuals vs. Fitted Values Plot [Plot showing random scatter of residuals around zero]

Figure 4: Normal Q-Q Plot of Residuals [Plot showing residuals approximately along diagonal line]

Conclusion on Assumptions: All four key assumptions of multiple linear regression appear to be reasonably satisfied. The model is appropriate for these data and the results can be considered valid.

Conclusion

This study successfully developed a multiple regression model to predict women's weight based on height and age. The analysis revealed that height is a statistically significant predictor of weight (p < 0.001), with each centimeter increase in height associated with a 0.85 kg increase in weight. However, age was not found to be a significant predictor when controlling for height (p = 0.140).

The model explains 69.2% of the variance in weight, indicating a moderately strong predictive ability. The standard error of 6.39 kg suggests reasonable prediction accuracy for practical applications.

All regression assumptions were satisfied, supporting the validity of the model. For a woman who is 166 cm tall and 27 years old, the predicted weight is 62.19 kg, with a 95% prediction interval of 49.21 to 75.17 kg.

Limitations: The sample size of 50 women may limit generalizability. Other potentially important factors such as body composition, physical activity level, and genetic factors were not included in this model.

Recommendations: Health professionals can use this model as a preliminary screening tool, though individual predictions should be interpreted with caution given the prediction interval width. Future research should consider additional predictors and larger sample sizes to improve prediction accuracy.

Example invalid form file feedback

Join our 150К of happy users

Get original papers written according to your instructions and save time for what matters most.