Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Scaling | 10 |
Scoring Formulas | 10 |
Item Analysis | 4 |
Test Construction | 4 |
Test Items | 4 |
Error of Measurement | 3 |
Scoring | 3 |
Criterion Referenced Tests | 2 |
Difficulty Level | 2 |
Field Tests | 2 |
Higher Education | 2 |
More ▼ |
Source
American Educational Research… | 1 |
ETS Research Report Series | 1 |
Language Assessment Quarterly | 1 |
Language Testing | 1 |
National Center for Education… | 1 |
School Science and Mathematics | 1 |
Author
Amsbary, Michelle | 1 |
Annis, Terri | 1 |
Attali, Yigal | 1 |
Baer, Justin | 1 |
Baldi, Stephane, Ed. | 1 |
Berlin, Martha | 1 |
Bernstein, Jared | 1 |
Binzer, Greg | 1 |
Clark, Lyn | 1 |
Dunleavy, Eric | 1 |
Forsyth, Barbara | 1 |
More ▼ |
Publication Type
Reports - Research | 6 |
Journal Articles | 5 |
Reports - Evaluative | 2 |
Speeches/Meeting Papers | 2 |
Numerical/Quantitative Data | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Japan | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of Adult… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016
Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests
Baldi, Stephane, Ed.; Kutner, Mark; Greenberg, Elizabeth; Jin, Ying; Baer, Justin; Moore, Elizabeth; Dunleavy, Eric; Berlin, Martha; Mohadjer, Leyla; Binzer, Greg; Krenzke, Thomas; Hogan, Jacqueline; Amsbary, Michelle; Forsyth, Barbara; Clark, Lyn; Annis, Terri; Bernstein, Jared; White, Sheida – National Center for Education Statistics, 2009
The 2003 National Assessment of Adult Literacy (NAAL) assessed the English literacy skills of a nationally representative sample of more than 19,000 U.S. adults (age 16 and older) residing in households and correctional institutions. NAAL is the first national assessment of adult literacy since the 1992 National Adult Literacy Survey (NALS). The…
Descriptors: Correctional Institutions, Scaling, Numeracy, Field Tests

Oltman, Phillip K.; Stricker, Lawrence J. – Language Testing, 1990
A recent multidimensional scaling analysis of the Test of English-as-a-Foreign-Language (TOEFL) item response data identified clusters of items in the test sections that, being more homogeneous than their parent sections, might be better for diagnostic use. The analysis was repeated using different scoring techniques. Results diverged only for…
Descriptors: English (Second Language), Item Analysis, Language Tests, Scaling

Rutledge, Michael L.; Warden, Melissa A. – School Science and Mathematics, 1999
Describes the development and validation of the Measure of Acceptance of the Theory of Evolution (MATE), a 20-item, Likert-scaled instrument that assesses teachers' overall acceptance of evolutionary theory. (Author/CCM)
Descriptors: Evolution, Higher Education, Mathematics Education, Scaling
Attali, Yigal – ETS Research Report Series, 2007
Because there is no commonly accepted view of what makes for good writing, automated essay scoring (AES) ideally should be able to accommodate different theoretical positions, certainly at the level of state standards but also perhaps among teachers at the classroom level. This paper presents a practical approach and an interactive computer…
Descriptors: Computer Assisted Testing, Automation, Essay Tests, Scoring

McGaw, Barry; Glass, Gene V. – American Educational Research Journal, 1980
There are difficulties in expressing effect sizes on a common metric when some studies use transformed scales to express group differences, or use factorial designs or covariance adjustments to obtain a reduced error term. A common metric on which effect sizes may be standardized is described. (Author/RL)
Descriptors: Control Groups, Error of Measurement, Mathematical Models, Research Problems
Hendrickson, Gerry F.; Green, Bert F., Jr. – 1972
It has been shown that Guttman weighting of test options results in marked increases in the internal consistency of a test. However, the effect of this type of weighting on the structure of the test is not known. Hence, the purpose of this study is to compare the factor structure of Guttman-weighted and rights-only-weighted tests and to relate the…
Descriptors: Analysis of Variance, Correlation, Factor Analysis, Item Analysis
Jaeger, Richard M. – 1980
Five statistical indices are developed and described which may be used for determining (1) when linear equating of two approximately parallel tests is adequate, and (2) whan a more complex method such as equipercentile equating must be used. The indices were based on: (1) similarity of cumulative score distributions; (2) shape of the raw-score to…
Descriptors: College Entrance Examinations, Difficulty Level, Equated Scores, Higher Education
Hambleton, Ronald K.; Novick, Melvin R. – 1972
In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…
Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling
Legg, Sue M. – 1982
A case study of the Florida Teacher Certification Examination (FTCE) program was described to assist others launching the development of large scale item banks. FTCE has four subtests: Mathematics, Reading, Writing, and Professional Education. Rasch calibrated item banks have been developed for all subtests except Writing. The methods used to…
Descriptors: Cutting Scores, Difficulty Level, Field Tests, Item Analysis