realizată de Morgan Denman 1 an în urmă
89
Mai multe ca aceasta
după Siti Zakirah
după Leyla Izaguirre
după Mindomo Team
după heba amr
Bro. Cloward talked about the little Croation boy that needed to take the test. He could not take the test because he was not represented in the norming group and therefore his results would have been invalid.
Stable over time. Stability must be reliable over time, measured using test-rest, and informal end isn't stable.
Internal
Consistency inside of itself, reliable across time such as split-half, alternate forms, and coefficent alpha
Inter-Rater
Reliable across time, results are the same no matter who the examiner is, and it is measured by inter-rater comparisons
The Salvia textbook defines Reliability as "To the extent that we can generalize from a particular set of observations that those observations are truthful."
I define reliability as able to be trusted, or trustworthy information
Use multiple measures an example of this is RIOT
Multiple times and places
Measures each test question to the standards. Think Praxis exams.
Comparing the test to a theory
The test and the theory behind it. Think IQ test or personality tests
Content
Comparing the test to the content taught
Not all tests have to do this. Measures curriculum. Think midterms
What is the content that is being tested?
Where
Where is the test taking place? Will there be any distractions?
When
What is the time and day of the test? Will it be in the morning, afternoon, or evening?
Why
Why is this person being tested and how is this person going to be tested?
"Internal structure refers to the number of dimensions or components within a domain that are represented on the test. For example if a test developer theorized that there were several components of intelligence, one would rightly expect that the resulting test would contain several components of intelligence."
Evidence of the relationships between the test and other performances
"The relationship to other performances refers to the accuracy with which test scores predict performance on the same type of test or other similar tests."
Evidence of convergent and discriminant power
"Convergent power refers to a test's ability to produce scores similar to those produced by other tests of the same ability and skills. Discriminant power refers to a test's ability to produce scores different from those produced by other tests of a different ability or skill."
Evidence of the consequences of testing
"Test's are administered with the expectation that some benefit will be realized either to the test taker or to the organization requiring the test."
The 3 types of Validity are: criteria, construct, content
Making Valid Decisions textbook defines validity as "Valid data that is accurate, reliable, robust, and complete. Accurate data answers the right questions and without errors. Reliable data is consistent across time, examiners, and internally. Complete data has no major holes in the data that might make the data lopsided toward one perspective over another. All four criteria must be present to call the data valid."
My definition of valid data is "Data that tests on it's intended content area. If a test is intended to test on reading comprehension, it should be testing reading comprehension."
I do want to add one little saying from Brother Cloward. He said we will use the acronym VdVo which stands for Valid data, Valid decision. Essentially if you know that the data is valid because ti comes from a valid source, the decision that you make regarding the test results will be valid as well.
"Validity has everything to do with the congruence to the standard." - Bro Cloward AKA to be valid it must correlate or align with the standard.
Which is why when I am in my practicums it is such a big deal to align my lesson plan to a state standard to make my lesson plan valid. As in to make sure that my lesson plan meets and aligns with the state standard I chose.