ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Statistical Analysis	40
Testing Problems	40
Test Validity	35
Test Construction	13
Test Reliability	13
Test Bias	11
Item Analysis	9
Scores	9
Measurement Techniques	7
Testing	7
Achievement Tests	6
Correlation	6
Elementary Secondary Education	6
Multiple Choice Tests	6
Test Interpretation	6
Test Items	6
Research Methodology	5
Statistical Studies	5
Culture Fair Tests	4
Data Analysis	4
Evaluation Criteria	4
Language Tests	4
Mathematical Models	4
Predictive Validity	4
Psychometrics	4
More ▼

Source

Didakometry	1
ETS Research Report Series	1
Education and Urban Society	1
Educational and Psychological…	1
Journal of Educational…	1
Journal of Educational…	1
NCME Measurement in Education	1
Psychometrika	1

Publication Type

Reports - Research	23
Speeches/Meeting Papers	10
Journal Articles	3
Reports - Evaluative	3
Information Analyses	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Opinion Papers	1
Reference Materials -…	1
Tests/Questionnaires	1

Education Level

Audience

Researchers

Location

Netherlands	2
California (Stanford)	1
Canada	1
China	1
Colorado (Denver)	1
Minnesota	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

General Aptitude Test Battery	2
Armed Services Vocational…	1
Metropolitan Achievement Tests	1
Metropolitan Readiness Tests	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Preparing for the Speaking Tasks of the "TOEFL iBT"® Test: An Investigation of the Journeys of Chinese Test Takers. "TOEFL iBT"® Research Report. TOEFL iBT-28. ETS Research Report. RR-17-19

Peer reviewed
PDF on ERIC

Download full text

Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017

Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language

A Statistical Procedure for Assessing Test Dimensionality. Measurement Series 84-2.

Stout, William – 1984

An important problem in psychological test theory is the development of a sound method for determining whether a test which purports to measure the level of a certain ability is, in reality, significantly contaminated by one or more other abilities displayed by persons taking the test. Because of the large number of private and governmental…

Descriptors: Latent Trait Theory, Statistical Analysis, Statistical Distributions, Test Validity

ADEQUACY OF TEST VALIDITIES FOR INDIVIDUAL PREDICTION.

Download full text

WEITZ, HENRY – 1967

COUNSELORS OFTEN ADMINISTER TESTS OF QUESTIONABLE VALIDITY. IN RELIABILITY STUDIES, EVERY PRECAUTION IS TAKEN TO STABILIZE THE STIMULUS SITUATION. IN ASSESSING VALIDITY, CONCERN CENTERS ON BEHAVIOR UNDER DIFFERENT STIMULUS CONDITIONS. CRONBACH'S THEORETICAL LIMIT FOR A VALIDITY COEFFICIENT OF A TEST IS THE SQUARE ROOT OF THE RELIABILITY…

Descriptors: Aptitude Tests, Career Counseling, Counseling, Counseling Objectives

A Quick Method for Determining Test Bias

Peer reviewed

Echternacht, Gary – Educational and Psychological Measurement, 1974

Descriptors: Evaluation Criteria, Probability, Statistical Analysis, Test Bias

A Comparison Among Person-Fit Measures.

Frary, Robert B. – 1982

Three measures of person-fit (the extent to which an examinee's response pattern on a multiple-choice test is consistent with his ability as estimated by total score) were computed for students taking classroom tests under 12 different instructors at a comprehensive university. Supplementary questions on each test inquired concerning students'…

Descriptors: Higher Education, Multiple Choice Tests, Predictive Validity, Reliability

Selection Bias: Multiple Meanings.

Peer reviewed

Linn, Robert L. – Journal of Educational Measurement, 1984

The common approach to studies of predictive bias is analyzed within the context of a conceptual model in which predictors and criterion measures are viewed as fallible indicators of idealized qualifications. (Author/PN)

Descriptors: Certification, Models, Predictive Measurement, Predictive Validity

Conditional Correlation Phenomena with Applications to University Admission Strategies.

Peer reviewed

Akemann, Charles A.; And Others – Journal of Educational Statistics, 1983

Generally, this paper aims to: (1) provide clarification, quantification, and some mathematical analysis to the statistical problem of restricted range in a college admissions situation; and (2) discuss various questions related to the problem of selection strategies. (Author/PN)

Descriptors: Admission Criteria, College Admission, Correlation, Higher Education

How to Tell if a Test Measures the Same Thing in Different Cultures.

Download full text

Frederiksen, Norman – 1976

A number of different ways of ascertaining whether or not a test measures the same thing in different cultures are examined. Methods range from some that are obvious and simple to those requiring statistical and psychological sophistication. Simpler methods include such things as having candidates "think aloud" and interviewing them about how they…

Descriptors: Analysis of Covariance, Culture Fair Tests, Factor Analysis, Item Analysis

GATB: Does the Apparatus Make a Difference?

Download full text

Kapes, Jerome T. – 1975

Two independent studies were conducted to investigate possible differences in General Aptitude Test Battery (GATB) aptitude M resulting from the use of different test equipment (wooden vs. plastic apparatus.) As part of a ten-year longitudinal study of Vocational Development being conducted in the Department of Vocational Education at The…

Descriptors: Aptitude Tests, Comparative Analysis, Elementary Secondary Education, Scores

Frequency Words and Frequencies: A Pilot Study on Relations Between Differently Anchored Scales. Didakometry; No. 44, November 1974.

Download full text

Larsson, Bernt – Didakometry, 1974

Subjects are asked to answer six questions, partly with a frequency and partly by marking a verbally anchored scale with five categories. Some univariate and multivariate analyses are performed to elucidate the relations between variables with the two different modes of response. Although there are similarities in results for the two types of…

Descriptors: Measurement Techniques, Measures (Individuals), Rating Scales, Responses

Fairness in Educational Achievement Testing

Peer reviewed

Tittle, Carol Kehr – Education and Urban Society, 1975

The purpose of this paper is to describe a set of procedures, that, when carried out, permit the conclusion that a test is a fair measure from the standpoint of specific sub-groups within a test population. A fair test is defined as a test for which a set of data-collection procedures have been carried out and the results reported. (Author/JM)

Descriptors: Academic Achievement, Achievement Tests, Evaluation Criteria, Measurement Techniques

Evaluation Design Project: Multilevel Interpretation of Evaluation Data Study.

Download full text

Miller, M. David; Burstein, Leigh – 1981

Two studies are presented in this report. The first is titled "Empirical Studies of Multilevel Approaches to Test Development and Interpretation: Measuring Between-Group Differences in Instruction." Because of a belief that schooling does affect student achievement, researchers have questioned the empirical and measurement techniques…

Descriptors: Error Patterns, Evaluation Methods, Item Analysis, Models

A Model for Assessing the Effects of Departures from Reality in Performance Testing.

Download full text

Morse, David T.; Morse, Linda W. – 1976

Performance testing often entails the usage of expensive, time-consuming measures in the quest for determining the level of performance on some desired behavior. It is concluded that a generalizability theory approach to dealing with departures from reality in testing can aid in the establishment of empirically-based choices of measurement…

Descriptors: Cost Effectiveness, Decision Making, Mathematical Models, Measurement Techniques

Reducing Bias in Achievement Tests.

Download full text

Green, Donald Ross – 1976

During the past few years the problem of bias in testing has become an increasingly important issue. In most research, bias refers to the fair use of tests and has thus been defined in terms of an outside criterion measure of the performance being predicted by the test. Recently however, there has been growing interest in assessing bias when such…

Descriptors: Achievement Tests, Item Analysis, Mathematical Models, Minority Groups

Test-Wiseness Cues in the Options of Mathematics Items.

Kuntz, Patricia – 1982

The quality of mathematics multiple choice items and their susceptibility to test wiseness were examined. Test wiseness was defined as "a subject's capacity to utilize the characteristics and formats of the test and/or test taking situation to receive a high score." The study used results of the Graduate Record Examinations Aptitude Test (GRE) and…

Descriptors: Cues, Item Analysis, Multiple Choice Tests, Psychometrics

Previous Page | Next Page »

Pages: 1 | 2 | 3

Hurley, Christine	2
Spicuzza, Richard	2
Thurlow, Martha	2
ANDRADE, MANUEL	1
Akemann, Charles A.	1
Barker, Pierce	1
Bleistein, Carole A.	1
Bormuth, John R.	1
Broussard, Rolland L.	1
Burstein, Leigh	1
Ebel, Robert L.	1
Echternacht, Gary	1
El Sawaf, Hamdy	1
Erickson, Ronald	1
Fang, Lin	1
Frary, Robert B.	1
Frederiksen, Norman	1
Gordon, Howard R. D.	1
Green, Donald Ross	1
Hambleton, Ronald K.	1
He, Lianzhen	1
Hendrickson, Gerry F.	1
Isaac, Stephen	1
More ▼