ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Descriptor

Scaling	10
Scoring Formulas	10
Item Analysis	4
Test Construction	4
Test Items	4
Error of Measurement	3
Scoring	3
Criterion Referenced Tests	2
Difficulty Level	2
Field Tests	2
Higher Education	2
Language Tests	2
Statistical Analysis	2
Test Reliability	2
Test Validity	2
Testing Problems	2
Weighted Scores	2
Achievement Tests	1
Adult Literacy	1
Analysis of Variance	1
Automation	1
Benchmarking	1
College Entrance Examinations	1
College Students	1
Comparative Analysis	1
More ▼

Source

American Educational Research…	1
ETS Research Report Series	1
Language Assessment Quarterly	1
Language Testing	1
National Center for Education…	1
School Science and Mathematics	1

Publication Type

Reports - Research	6
Journal Articles	5
Reports - Evaluative	2
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Japan	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of Adult…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Guessing and the Rasch Model

Peer reviewed

Direct link

Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016

Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…

Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests

Technical Report and Data File User's Manual: For the 2003 National Assessment of Adult Literacy. NCES 2009-476

Peer reviewed
PDF on ERIC

Download full text

Baldi, Stephane, Ed.; Kutner, Mark; Greenberg, Elizabeth; Jin, Ying; Baer, Justin; Moore, Elizabeth; Dunleavy, Eric; Berlin, Martha; Mohadjer, Leyla; Binzer, Greg; Krenzke, Thomas; Hogan, Jacqueline; Amsbary, Michelle; Forsyth, Barbara; Clark, Lyn; Annis, Terri; Bernstein, Jared; White, Sheida – National Center for Education Statistics, 2009

The 2003 National Assessment of Adult Literacy (NAAL) assessed the English literacy skills of a nationally representative sample of more than 19,000 U.S. adults (age 16 and older) residing in households and correctional institutions. NAAL is the first national assessment of adult literacy since the 1992 National Adult Literacy Survey (NALS). The…

Descriptors: Correctional Institutions, Scaling, Numeracy, Field Tests

Developing Homogeneous TOEFL Scales by Multidimensional Scaling.

Peer reviewed

Oltman, Phillip K.; Stricker, Lawrence J. – Language Testing, 1990

A recent multidimensional scaling analysis of the Test of English-as-a-Foreign-Language (TOEFL) item response data identified clusters of items in the test sections that, being more homogeneous than their parent sections, might be better for diagnostic use. The analysis was repeated using different scoring techniques. Results diverged only for…

Descriptors: English (Second Language), Item Analysis, Language Tests, Scaling

The Development and Validation of the Measure of Acceptance of the Theory of Evolution Instrument.

Peer reviewed

Rutledge, Michael L.; Warden, Melissa A. – School Science and Mathematics, 1999

Describes the development and validation of the Measure of Acceptance of the Theory of Evolution (MATE), a 20-item, Likert-scaled instrument that assesses teachers' overall acceptance of evolutionary theory. (Author/CCM)

Descriptors: Evolution, Higher Education, Mathematics Education, Scaling

On-the-Fly Customization of Automated Essay Scoring. Research Report. ETS RR-07-42

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

Because there is no commonly accepted view of what makes for good writing, automated essay scoring (AES) ideally should be able to accommodate different theoretical positions, certainly at the level of state standards but also perhaps among teachers at the classroom level. This paper presents a practical approach and an interactive computer…

Descriptors: Computer Assisted Testing, Automation, Essay Tests, Scoring

Choice of the Metric for Effect Size in Meta-analysis.

Peer reviewed

McGaw, Barry; Glass, Gene V. – American Educational Research Journal, 1980

There are difficulties in expressing effect sizes on a common metric when some studies use transformed scales to express group differences, or use factorial designs or covariance adjustments to obtain a reduced error term. A common metric on which effect sizes may be standardized is described. (Author/RL)

Descriptors: Control Groups, Error of Measurement, Mathematical Models, Research Problems

Comparison of the Factor Structure of Guttman-Weighted vs. Rights-Only-Weighted Tests.

Download full text

Hendrickson, Gerry F.; Green, Bert F., Jr. – 1972

It has been shown that Guttman weighting of test options results in marked increases in the internal consistency of a test. However, the effect of this type of weighting on the structure of the test is not known. Hence, the purpose of this study is to compare the factor structure of Guttman-weighted and rights-only-weighted tests and to relate the…

Descriptors: Analysis of Variance, Correlation, Factor Analysis, Item Analysis

Some Exploratory Indices for Selection of a Test Equating Method.

Jaeger, Richard M. – 1980

Five statistical indices are developed and described which may be used for determining (1) when linear equating of two approximately parallel tests is adequate, and (2) whan a more complex method such as equipercentile equating must be used. The indices were based on: (1) similarity of cumulative score distributions; (2) shape of the raw-score to…

Descriptors: College Entrance Examinations, Difficulty Level, Equated Scores, Higher Education

Toward an Integration of Theory and Method for Criterion-Referenced Tests.

Download full text

Hambleton, Ronald K.; Novick, Melvin R. – 1972

In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…

Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling

The Use of Precalibrated Item Bank to Establish and Maintain Cutoff Scores: A Case Study of the Florida Teacher Certification Examination.

Download full text

Legg, Sue M. – 1982

A case study of the Florida Teacher Certification Examination (FTCE) program was described to assist others launching the development of large scale item banks. FTCE has four subtests: Mathematics, Reading, Writing, and Professional Education. Rasch calibrated item banks have been developed for all subtests except Writing. The methods used to…

Descriptors: Cutting Scores, Difficulty Level, Field Tests, Item Analysis

Amsbary, Michelle	1
Annis, Terri	1
Attali, Yigal	1
Baer, Justin	1
Baldi, Stephane, Ed.	1
Berlin, Martha	1
Bernstein, Jared	1
Binzer, Greg	1
Clark, Lyn	1
Dunleavy, Eric	1
Forsyth, Barbara	1
Glass, Gene V.	1
Green, Bert F., Jr.	1
Greenberg, Elizabeth	1
Hambleton, Ronald K.	1
Hendrickson, Gerry F.	1
Hogan, Jacqueline	1
Holster, Trevor A.	1
Jaeger, Richard M.	1
Jin, Ying	1
Krenzke, Thomas	1
Kutner, Mark	1
Lake, J.	1
Legg, Sue M.	1
McGaw, Barry	1
More ▼