Math Help Forum

Math Help Forum Feed Site Feed

Go Back   Math Help Forum > High School Math Help > Probability and Statistics
Reply
 
Thread Tools Display Modes
  #1  
Old 08-25-2008, 09:58 PM
Junior Member
 
Join Date: Aug 2008
Posts: 5
Country:
Thanks: 0
Thanked 0 Times in 0 Posts
grain is on a distinguished road
Default Pair t test or....?

Hello,

What can use instead of the ‘pair t test’?

I have two variables both of which are recording temperature of the same house at 1 hr intervals over the same time period. One of the variables is the real-world measurements and the other is temperature modelled by myself in a program.

I need an indication of how separate two sets of measurements are. They are measured over 2 months so I have 1300 measurements i.e. n = 1300.

These temperatures oscillate over the day like a sine wave and so aren’t normally distributed.

Can anyone please help?
Reply With Quote
Advertisement
 
  #2  
Old 08-27-2008, 10:04 AM
MHF Contributor
 
Join Date: Aug 2007
Location: USA
Posts: 1,264
Country:
Thanks: 8
Thanked 279 Times in 258 Posts
TKHunny is a glorious beacon of lightTKHunny is a glorious beacon of lightTKHunny is a glorious beacon of lightTKHunny is a glorious beacon of lightTKHunny is a glorious beacon of light
Default

Even if you post on 12 more websites, you should continue to pursue the "Goodness of Fit" angle. When was your last Statistics Class?
Reply With Quote
  #3  
Old 08-27-2008, 04:08 PM
Senior Member
 
Join Date: Mar 2008
Posts: 125
Country:
Thanks: 4
Thanked 65 Times in 54 Posts
awkward will become famous soon enoughawkward will become famous soon enough
Default

Quote:
Originally Posted by grain View Post
Hello,

What can use instead of the ‘pair t test’?

I have two variables both of which are recording temperature of the same house at 1 hr intervals over the same time period. One of the variables is the real-world measurements and the other is temperature modelled by myself in a program.

I need an indication of how separate two sets of measurements are. They are measured over 2 months so I have 1300 measurements i.e. n = 1300.

These temperatures oscillate over the day like a sine wave and so aren’t normally distributed.

Can anyone please help?
Let's say the measured temperatures are T_i,\, i=1,2,\dots,1300 and the modeled temperatures are M_i,\, i=1,2,\dots,1300. My suggestion would be to compute the values T_i - M_i and apply a Z-test for zero mean. That way your assumption is that the T_i - M_i values are normally distributed, which seems reasonable, and the fact that the temperatures have a pattern may be irrelevant.
Reply With Quote
  #4  
Old 08-28-2008, 05:21 AM
Junior Member
 
Join Date: Aug 2008
Posts: 5
Country:
Thanks: 0
Thanked 0 Times in 0 Posts
grain is on a distinguished road
Default

Quote:
Originally Posted by TKHunny View Post
Even if you post on 12 more websites, you should continue to pursue the "Goodness of Fit" angle. When was your last Statistics Class?
Yes, my last Statistics class was a couple of years ago, hence the somewhat wondering in dark.

From what I've read I need to use the chi square statistic, which is a sum of differences between observed and expected outcome frequencies, each squared and divided by the expectation: The degrees of freedom df = (rows-1)(columns-1)

My problem is that, my degrees of freedom is = (1200-1)(2-1) = 1200

All the tables I've seen only go up to 9 or so df's.

Does this mean I should just take a random sample of 10 measurements and apply it to that rather than the entire data set?

Thanks for your patience.
Reply With Quote
  #5  
Old 08-28-2008, 06:26 AM
mr fantastic's Avatar
Flow Master
 
Join Date: Dec 2007
Location: Zeitgeist
Posts: 6,062
Country:
Thanks: 955
Thanked 2,342 Times in 2,116 Posts
mr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond repute
Default

Quote:
Originally Posted by grain View Post
Yes, my last Statistics class was a couple of years ago, hence the somewhat wondering in dark.

From what I've read I need to use the chi square statistic, which is a sum of differences between observed and expected outcome frequencies, each squared and divided by the expectation: The degrees of freedom df = (rows-1)(columns-1)

My problem is that, my degrees of freedom is = (1200-1)(2-1) = 1200

All the tables I've seen only go up to 9 or so df's. Mr F says: Then you're looking in the wrong places. Tables of critical values routinely go up to 100 degrees of freedom. See PlanetMath: table of critical values of chi-squared distributions for example.

Does this mean I should just take a random sample of 10 measurements and apply it to that rather than the entire data set?

Thanks for your patience.
Throwing away perfectly good data is totally absurd.

The chi-squared distribution is asymptotically normal as the number of degrees of freedom becomes infinite. It has mean = n and variance = 2n; for a large degree of freedom (and 1200 is large) you can get a very good approximation using the normal distribution with this mean and variance.

That's why tables are usually not made for more than 100 degrees of freedom.
__________________
There are two things you should never try to prove ...... the impossible and the obvious.

Lack of planning on your part does not constitute an emergency on my part.

Pressure makes diamonds.
Reply With Quote
  #6  
Old 08-29-2008, 05:21 AM
Junior Member
 
Join Date: Aug 2008
Posts: 5
Country:
Thanks: 0
Thanked 0 Times in 0 Posts
grain is on a distinguished road
Default

Quote:
Originally Posted by mr fantastic View Post
Throwing away perfectly good data is totally absurd.

The chi-squared distribution is asymptotically normal as the number of degrees of freedom becomes infinite. It has mean = n and variance = 2n; for a large degree of freedom (and 1200 is large) you can get a very good approximation using the normal distribution with this mean and variance.

That's why tables are usually not made for more than 100 degrees of freedom.
Thanks for that


Ok, I think I’ve got it now. Here it goes:

My null hypothesis is that there is no difference between the observed and expected temperatures.

My calculated chi statistic = 2.76

I have 1276 rows which effectively gives a degree of freedom of 100 (tables limit).

The critical value for the chi-square at significance of 0.05 is =124.3; if the calculated chi-square value is equal to or greater than this critical value, I can conclude that the probability of the null hypothesis being correct is 0.05. i.e. a very low probability. But it isn’t! it is in fact much less than 124.3. This means that because my degrees of freedom are so high and my chi statistic so low, I can accept the null hypothesis with great confidence.

Is my method of calculation correct or should I be using a different statistic because my degrees of freedom are so large?
Reply With Quote
  #7  
Old 08-29-2008, 05:45 AM
mr fantastic's Avatar
Flow Master
 
Join Date: Dec 2007
Location: Zeitgeist
Posts: 6,062
Country:
Thanks: 955
Thanked 2,342 Times in 2,116 Posts
mr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond reputemr fantastic has a reputation beyond repute
Default

Quote:
Originally Posted by grain View Post
Thanks for that


Ok, I think I’ve got it now. Here it goes:

My null hypothesis is that there is no difference between the observed and expected temperatures.

My calculated chi statistic = 2.76

I have 1276 rows which effectively gives a degree of freedom of 100 (tables limit).

The critical value for the chi-square at significance of 0.05 is =124.3; if the calculated chi-square value is equal to or greater than this critical value, I can conclude that the probability of the null hypothesis being correct is 0.05. i.e. a very low probability. But it isn’t! it is in fact much less than 124.3. This means that because my degrees of freedom are so high and my chi statistic so low, I can accept the null hypothesis with great confidence.

Is my method of calculation correct or should I be using a different statistic because my degrees of freedom are so large?
If you're testing goodness of fit between your model and the data, your calculation looks OK to me.
__________________
There are two things you should never try to prove ...... the impossible and the obvious.

Lack of planning on your part does not constitute an emergency on my part.

Pressure makes diamonds.
Reply With Quote
  #8  
Old 08-29-2008, 07:25 PM
Junior Member
 
Join Date: Aug 2008
Posts: 5
Country:
Thanks: 0
Thanked 0 Times in 0 Posts
grain is on a distinguished road
Default

Quote:
Originally Posted by mr fantastic View Post
If you're testing goodness of fit between your model and the data, your calculation looks OK to me.
Great, Thanks for that.

Goodness of fit is what I'm trying to achieve so that's a relief!
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off
Forum Jump


All times are GMT -7. The time now is 03:15 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
SEO by vBSEO 3.2.0 ©2008, Crawlability, Inc.
©2005 - 2008 Math Help Forum


Math Help Forum is a community of maths forums with an emphasis on maths help in all levels of mathematics.
Register to post your math questions or just hang out and try some of our math games or visit the arcade.