ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	7

Descriptor

Psychometrics	13
Scores	13
Testing Problems	13
Test Interpretation	5
Test Validity	5
Measurement Techniques	4
Test Items	4
Test Reliability	4
Elementary Secondary Education	3
Error of Measurement	3
Standardized Tests	3
Comparative Analysis	2
Difficulty Level	2
Educational Assessment	2
Evaluation Methods	2
Evaluation Problems	2
Evaluation Research	2
Foreign Countries	2
Latent Trait Theory	2
Measurement	2
Test Construction	2
Testing	2
Academic Ability	1
Academic Achievement	1
Achievement Tests	1
More ▼

Source

Applied Measurement in…	1
ETS Research Report Series	1
International Journal of…	1
Journal of Computer Assisted…	1
Journal of Educational…	1
Journal of Educational and…	1
Learning Disability Quarterly	1
Measurement:…	1
Psychology in the Schools	1
Research in Education	1

Publication Type

Journal Articles	10
Reports - Research	8
Reports - Descriptive	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Information Analyses	1
Opinion Papers	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Early Childhood Education	1
Junior High Schools	1
Preschool Education	1
Secondary Education	1

Audience

Location

Japan	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
New Jersey College Basic…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Measurement Invariance of Scores on the Teacher Stress Scale: International Sample of PreK-12 Teachers

Peer reviewed

Direct link

Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025

Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…

Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Assessing Mode Effects of At-Home Testing without a Randomized Trial. Research Report. ETS RR-21-10

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021

In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…

Descriptors: Testing, Distance Education, Comparative Analysis, Test Items

Digital-First Assessments: A Security Framework

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022

Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…

Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering

The Leading Group Effect: Illusionary Declines in Scholastic Standard Scores of Mid-Range Japanese Junior High School Pupils

Peer reviewed

Direct link

Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012

Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…

Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems

Toward Developmental Trajectories: A Commentary on "Assessing Measures of Mathematical Knowledge for Teaching"

Peer reviewed

Direct link

Kulikowich, Jonna M. – Measurement: Interdisciplinary Research and Perspectives, 2007

Operating from multiple literature bases in cognitive psychology, mathematics education, and theoretical and applied psychometrics, Schilling, Hill and their colleagues provide a systemic approach to studying the validity of scores of mathematical knowledge for teaching. This system encompasses an array of task formats and methodologies. The…

Descriptors: Multiple Choice Tests, Learning Theories, Teaching Methods, Construct Validity

Specialists' Use of Tests and Clinical Judgment in the Diagnosis of Learning Disabilities.

Peer reviewed

Davis, W. Alan; Shepard, Lorrie A. – Learning Disability Quarterly, 1983

The tests used in the identification of learning disabilitis and the extent of knowledge specialists have regarding the technical aspects of tests were assessed. All groups of specialists tended to overrate the tests they used, and generally indicated a lack of familiarity with the psychometric properties of commonly used tests. (Author/SW)

Descriptors: Clinical Diagnosis, Diagnostic Tests, Disability Identification, Knowledge Level

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Psychometric Issues in Testing Students with Disabilities.

Peer reviewed

Geisinger, Kurt F. – Applied Measurement in Education, 1994

Federal law requires that individuals with handicapping conditions be administered assessments in ways that accommodate their disabilities without penalizing them. Validation studies are needed to evaluate the meaning of scores resulting from nonstandard test administrations. The limited number of these studies to date is reviewed. (SLD)

Descriptors: Disabilities, Educational Assessment, Elementary School Students, Elementary Secondary Education

Determining Optimal Test Lengths with a Fixed Total Testing Time.

Download full text

Hambleton, Ronald K. – 1986

The problem of determining optimal test lengths with fixed total testing time has proved to be a difficult one for criterion-referenced test developers. An algorithm is needed which can be used by test developers to allocate available testing time to maximize the validity of their total criterion-referenced tests or testing programs. To be…

Descriptors: Algorithms, Criterion Referenced Tests, Elementary Secondary Education, Psychometrics

A Comparison of Rasch Person Analysis and Robust Estimators.

Smith, Richard M. – 1983

Measurement disturbances, such as guessing, startup, and plodding, often result in an examinee's ability being either over- or under-estimated by the maximum likelihood estimation employed in latent trait psychometric models. Several authors have suggested methods to lessen the impact of unexpected responses on the ability estimation process. This…

Descriptors: Difficulty Level, Error of Measurement, Estimation (Mathematics), Goodness of Fit

A New Approach to Avoiding Problems of Scale in Interpreting Trends in Mental Measurement Data.

Peer reviewed

Braun, Henry I. – Journal of Educational Measurement, 1988

A new approach to the quantification and interpretation of change when only repeated cross-sectional data are available--the Trajectory Analysis of Matched Percentiles--is presented. A recent attempt to interpret the findings on reading achievement of the National Assessment of Educational Progress is critically analyzed. (SLD)

Descriptors: Academic Achievement, Achievement Tests, Data Collection, Data Interpretation

Test Fairness Is a Personal Issue!

Smith, Richard M. – 1983

Previous studies of test item bias have investigated how different groups of examinees perform differently on a given set of items. These studies imply that examinees should be treated in a certain way because they are of a particular sex or race rather than as individuals in their own right, but it is unrealistic and unfair to assume such an…

Descriptors: Academic Ability, Error of Measurement, Error Patterns, Higher Education

Smith, Richard M.	2
Attali, Yigal	1
Baig, Basim	1
Braun, Henry I.	1
Chen, Yunxiao	1
Davis, W. Alan	1
Foster, Jeff L.	1
Geisinger, Kurt F.	1
Hambleton, Ronald K.	1
Horie, André Kenji	1
Jiayi Wang	1
Kim, Sooyeon	1
Kulikowich, Jonna M.	1
LaFlair, Geoffrey T.	1
Langenfeld, Thomas	1
Lee, Yi-Hsuan	1
Li, Xiaoou	1
Meyer, Kevin D.	1
Michael T. Kalkbrenner	1
Mori, Kazuo	1
Riley Schaner	1
Shepard, Lorrie A.	1
Uchida, Akitoshi	1
Walker, Michael	1
von Davier, Alina A.	1
More ▼