ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	7

Descriptor

Comparative Testing	19
Item Analysis	19
Test Validity	19
Test Items	8
Achievement Tests	6
Foreign Countries	6
Test Reliability	5
Higher Education	4
Multiple Choice Tests	4
Test Bias	4
Test Construction	4
Test Interpretation	4
Academic Achievement	3
Achievement Gains	3
Admission Criteria	3
Comparative Analysis	3
Computer Assisted Testing	3
Educational Testing	3
Equated Scores	3
Language Skills	3
Minority Groups	3
Psychometrics	3
Scores	3
Test Theory	3
Academic Standards	2
More ▼

Source

Applied Measurement in…	1
Applied Psychological…	1
Canadian Journal of School…	1
Educational Assessment	1
Evaluation and the Health…	1
GED Testing Service	1
International Journal of…	1
Journal of Economic Education	1
Journal of Education for…	1
Journal of Research in Reading	1
National Foundation for…	1
School Science and Mathematics	1
More ▼

Publication Type

Reports - Research	15
Journal Articles	10
Speeches/Meeting Papers	2
Tests/Questionnaires	2
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	5
Grade 4	2
High Schools	2
Elementary Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 7	1
Postsecondary Education	1
Preschool Education	1

Audience

Location

Canada	1
Canada (Edmonton)	1
Germany	1
Surinam	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	2
Embedded Figures Test	1
Gates MacGinitie Reading Tests	1
General Educational…	1
Iowa Tests of Basic Skills	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Does MTV Really Do a Good Job of Evaluating Professors? An Empirical Test of the Internet Site Ratemyprofessors.com

Peer reviewed

Direct link

Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016

Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…

Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis

Validity of the Children's Orientation to Book Reading Rating Scale

Peer reviewed

Direct link

Kaderavek, Joan N.; Guo, Ying; Justice, Laura M. – Journal of Research in Reading, 2014

The present study investigates the validity of a 4-point rating scale used to measure the level of preschool children's orientation to literacy during shared book reading. Validity was explored by (a) comparing the children's level of literacy orientation as measured with the "Children's Orientation to Book Reading Rating Scale" (COB)…

Descriptors: Reading Habits, Reading Interests, Rating Scales, Test Validity

Not Read, but Nevertheless Solved? Three Experiments on PIRLS Multiple Choice Reading Comprehension Test Items

Peer reviewed

Direct link

Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012

Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…

Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4

Stability of Rasch Scales over Time

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010

Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…

Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Validation of the TOEFL as a Canadian University Admissions Requirement

Peer reviewed

Direct link

Simner, Marvin L.; Mitchell, John B. – Canadian Journal of School Psychology, 2007

The Test of English as a Foreign Language (TOEFL) is widely used to screen university applicants for whom English is not their native language. Although the cutoff scores vary, in Ontario those with scores much lower than 550 are rarely admitted to any university. Two exceptions are the University of Western Ontario and its affiliate, Brescia…

Descriptors: Foreign Countries, Program Validation, English (Second Language), Standardized Tests

Use of an Aptitude Test in University Entrance--A Validity Study: Relationships between SAT[R] Scores, Attainment Measures and Background Variables. Research Report RR846

Download full text

Kirkup, Catherine; Schagen, Ian; Wheater, Rebecca; Morrison, Jo; Whetton, Chris – National Foundation for Educational Research, 2007

In September 2005 the National Foundation for Educational Research (NFER) in association with the Department for Education and Skills (DfES), the Sutton Trust and the College Board, began a five-year research study to examine the validity of an aptitude test in higher education admissions. This report describes and explores the relationships…

Descriptors: Educational Research, Academic Achievement, Student Surveys, Aptitude Tests

A Comparison of the Fairness of Adaptive and Conventional Testing Strategies. Research Report 78-1.

Download full text

Pine, Steven M.; Weiss, David J. – 1978

This report examines how selection fairness is influenced by the characteristics of a selection instrument in terms of its distribution of item difficulties, level of item discrimination, degree of item bias, and testing strategy. Computer simulation was used in the administration of either a conventional or Bayesian adaptive ability test to a…

Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Testing, Computer Assisted Testing

Exploratory Studies of Bias in Achievement Tests.

Download full text

Green, Donald Ross; Draper, John F. – 1972

This paper considers the question of bias in group administered academic achievement tests, bias which is inherent in the instruments themselves. A body of data on the test of performance of three disadvantaged minority groups--northern, urban black; southern, rural black; and, southwestern, Mexican-Americans--as tryout samples in contrast to…

Descriptors: Achievement Tests, Bias, Comparative Testing, Educational Testing

A Comparison of the Relative Efficiency and Validity of Tailored Tests and Conventional Quizzes.

Peer reviewed

Kent, Thomas H.; Albanese, Mark A. – Evaluation and the Health Professions, 1987

Two types of computer-administered unit quizzes in a systematic pathology course for second-year medical students were compared. Quizzes composed of questions selected on the basis of a student's ability had higher correlations with the final examination than did quizzes composed of questions randomly selected from topic areas. (Author/JAZ)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level

Examining the Validity of GED[R] Tests Scores with Scheduling and Setting Accommodations. GED Testing Service Research Studies, 2004-1

Download full text

George-Ezzelle, Carol E.; Skaggs, Gary – GED Testing Service, 2004

Current testing standards call for test developers to provide evidence that testing procedures and test scores, and the inferences made based on the test scores, show evidence of validity and are comparable across subpopulations (American Educational Research Association [AERA], American Psychological Association [APA], & National Council on…

Descriptors: Scheduling, Testing Accommodations, Academic Achievement, Test Validity

Item-Option Weighting of Achievement Tests: Comparative Study of Methods.

Peer reviewed

Downey, Ronald G. – Applied Psychological Measurement, 1979

This research attempted to interrelate several methods of producing option weights (i.e., Guttman internal and external weights and judges' weights) and examined their effects on reliability and on concurrent, predictive, and face validity. It was concluded that option weighting offered limited, if any, improvement over unit weighting. (Author/CTM)

Descriptors: Achievement Tests, Answer Keys, Comparative Testing, High Schools

Traditional vs. Retrospective Pretests for Measuring Science Teaching Efficacy Beliefs in Preservice Teachers

Peer reviewed

Direct link

Cantrell, Pamela – School Science and Mathematics, 2003

The difference in gain scores produced by traditional pretests and those produced by retrospective pretests when compared to posttest scores on the Science Teaching Efficacy Belief Instrument for preservice teachers was investigated in this study. Results indicated that gain scores using the traditional pretest produced significant improvement in…

Descriptors: Pretests Posttests, Validity, Scores, Preservice Teachers

General Report on Edmonton Grade III Achievement. 1956-1977 Comparisons.

Clarke, S. C. T.; And Others – 1977

To compare achievement standards of 1977 with those of 1956, three tests were administered in their original form to all third graders in a large school district. Approximately 3500 students in 1956 and 4500 in 1977 were administered the California Achievement Tests, the California Short Form Tests of Mental Maturity, and the Gates Advanced…

Descriptors: Academic Achievement, Achievement Gains, Achievement Tests, Arithmetic

Language Testing: The Construction and Use of Foreign Language Tests. A Teacher's Book.

Lado, Robert – 1961

Intended as a comprehensive introduction to the construction and use of foreign language tests, this book utilizes modern linguistic knowledge as a base for scientific language testing. Major attention in testing is focused on such integrated language skills as auditory and reading comprehension, speaking, writing, translation, and over-all…

Descriptors: Achievement Tests, Aptitude Tests, Comparative Testing, Cultural Education

Previous Page | Next Page »

Pages: 1 | 2

Albanese, Mark A.	1
Cantrell, Pamela	1
Clarke, S. C. T.	1
Coffman, William E.	1
Downey, Ronald G.	1
Draper, John F.	1
Drenth, Pieter J. D.	1
Elosua, Paula	1
George-Ezzelle, Carol E.	1
Green, Donald Ross	1
Guo, Ying	1
Iliescu, Dragos	1
Justice, Laura M.	1
Kaderavek, Joan N.	1
Kent, Thomas H.	1
Kimmel, Rumena	1
Kirkup, Catherine	1
Lado, Robert	1
Lee, Yoonsun	1
Lowenkamp, Lena	1
Melancon, Janet G.	1
Mitchell, John B.	1
Morrison, Jo	1
Murray, Keith B.	1
More ▼