ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	6

Descriptor

Psychometrics	27
Testing Problems	27
Test Construction	10
Educational Assessment	9
Performance Based Assessment	8
Test Validity	7
Elementary Secondary Education	6
Evaluation Methods	6
Test Items	6
Test Reliability	6
Achievement Tests	5
Measurement Techniques	5
Scoring	5
Simulation	5
Standardized Tests	5
Cognitive Tests	4
Computer Assisted Testing	4
Diagnostic Tests	4
Disabilities	4
Test Bias	4
Test Use	4
Academic Achievement	3
Alternative Assessment	3
Educational Change	3
Evaluation Problems	3
More ▼

Source

Journal of Educational…	4
Applied Measurement in…	2
Educational Measurement:…	2
Applied Psychological…	1
Educational Researcher	1
Educational and Psychological…	1
Hispanic Journal of…	1
Intelligence	1
Journal of Autism and…	1
Journal of Early Intervention	1
Learning Disability Quarterly	1
Mathematics Teacher Education…	1
Review of Research in…	1
More ▼

Publication Type

Reports - Evaluative	27
Journal Articles	18
Speeches/Meeting Papers	5
Information Analyses	3
Reports - Research	2
Books	1
Opinion Papers	1

Education Level

Elementary Secondary Education

Audience

Researchers

Location

United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	2
Armed Services Vocational…	1
Battelle Developmental…	1
Kaufman Assessment Battery…	1
Minnesota Multiphasic…	1
SAT (College Admission Test)	1
Texas Assessment of Academic…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Peer reviewed

Direct link

Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020

Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…

Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics

Critical Issues in Specific Learning Disability Identification: What We Need to Know about the PSW Model

Peer reviewed

Direct link

McGill, Ryan J.; Styck, Kara M.; Palomares, Ronald S.; Hass, Michael R. – Learning Disability Quarterly, 2016

As a result of the upcoming Federal reauthorization of the Individuals With Disabilities Education Improvement Act (IDEA), practitioners and researchers have begun vigorously debating what constitutes evidence-based assessment for the identification of specific learning disability (SLD). This debate has resulted in strong support for a method that…

Descriptors: Learning Disabilities, Disability Identification, Disabilities, Federal Legislation

Challenges and Strategies for Assessing Specialised Knowledge for Teaching

Peer reviewed
PDF on ERIC

Download full text

Orrill, Chandra Hawley; Kim, Ok-Kyeong; Peters, Susan A.; Lischka, Alyson E.; Jong, Cindy; Sanchez, Wendy B.; Eli, Jennifer A. – Mathematics Teacher Education and Development, 2015

Developing and writing assessment items that measure teachers' knowledge is an intricate and complex undertaking. In this paper, we begin with an overview of what is known about measuring teacher knowledge. We then highlight the challenges inherent in creating assessment items that focus specifically on measuring teachers' specialised knowledge…

Descriptors: Specialization, Knowledge Base for Teaching, Educational Strategies, Testing Problems

Impact of Diagnosticity on the Adequacy of Models for Cognitive Diagnosis under a Linear Attribute Structure: A Simulation Study

Peer reviewed

Direct link

de La Torre, Jimmy; Karelitz, Tzur M. – Journal of Educational Measurement, 2009

Compared to unidimensional item response models (IRMs), cognitive diagnostic models (CDMs) based on latent classes represent examinees' knowledge and item requirements using discrete structures. This study systematically examines the viability of retrofitting CDMs to IRM-based data with a linear attribute structure. The study utilizes a procedure…

Descriptors: Simulation, Item Response Theory, Psychometrics, Evaluation Methods

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Assessing the Functions of Aberrant Behaviors: A Review of Psychometric Instruments.

Peer reviewed

Sturmey, Peter – Journal of Autism and Developmental Disorders, 1994

This paper reviews the psychometric properties, treatment utility, and conceptual basis of instruments used to identify the functions of aberrant behaviors in people with developmental disabilities. Instruments include the Motivational Assessment Scale, Motivation Analysis Rating Scale, Functional Analysis Interview Form, and Functional Analysis…

Descriptors: Behavior Problems, Developmental Disabilities, Evaluation Methods, Motivation

The Accuracy of Three Developmental Screening Tests.

Peer reviewed

Glascoe, Frances Page; Byrne, Karen E. – Journal of Early Intervention, 1993

The accuracy of 3 developmental screening tests administered to 89 young children was compared. The Battelle Developmental Inventory Screening Test was more accurate than the Academic Scale of the Developmental Profile-II and the Denver-II, identifying correctly 72% of children with difficulties and 76% of children without diagnoses. (Author/JDD)

Descriptors: Child Development, Disabilities, Disability Identification, Early Identification

Traditional Dimensionality vs. Essential Dimensionality.

Download full text

Nandakumar, Ratna – 1989

The theoretical differences between the traditional definition of dimensionality and the more recently defined notion of essential dimensionality are presented. Monte Carlo simulations are used to demonstrate the utility of W. F. Stout's procedure to assess the essential unidimensionality of the latent space underlying a set of terms. The…

Descriptors: Definitions, Educational Assessment, Latent Trait Theory, Mathematical Models

Two Weak Spots in the Practice of Criterion-referenced Measurement.

Peer reviewed

Linn, Robert L. – Educational Measurement: Issues and Practice, 1982

Confusion in the terminology used in criterion-referenced measurement specifications and development and standard setting and the attendant role of cut-off scores are shown to need practical clarification through psychometric research on test applications and consequences. (CM)

Descriptors: Academic Standards, Criterion Referenced Tests, Cutting Scores, Measurement Objectives

Sequential Reliability Tests.

Peer reviewed

Eiting, Mindert H. – Applied Psychological Measurement, 1991

A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)

Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models

Toward a Psychometrics for Testlets.

Peer reviewed

Wainer, Howard; Lewis, Charles – Journal of Educational Measurement, 1990

Three different applications of the testlet concept are presented, and the psychometric models most suitable for each application are described. Difficulties that testlets can help overcome include (1) context effects; (2) item ordering; and (3) content balancing. Implications for test construction are discussed. (SLD)

Descriptors: Algorithms, Computer Assisted Testing, Elementary Secondary Education, Item Response Theory

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

Practical Issues in Large-Scale High-Stakes Computerized Adaptive Testing.

PDF pending restoration

Mills, Craig N.; Stocking, Martha L. – 1995

Computerized adaptive testing (CAT), while well-grounded in psychometric theory, has had few large-scale applications for high-stakes, secure tests in the past. This is now changing as the cost of computing has declined rapidly. As is always true where theory is translated into practice, many practical issues arise. This paper discusses a number…

Descriptors: Adaptive Testing, Computer Assisted Testing, High Stakes Tests, Item Banks

Overview of the Most Difficult Technical Issues on the VNT.

Download full text

Skaggs, Gary; Bourque, Mary Lyn – 1998

Political and legislative pressures have posed a number of measurement issues and challenges to the development of sound, valid voluntary national tests (VNTs). This paper focuses on what appear to be the most difficult technical issues related to the VNT proposed by President Clinton in 1997. Technical issues refer to psychometric issues, as…

Descriptors: Academic Achievement, Achievement Tests, Classification, Difficulty Level

Psychometric Inadequacies of the TAAS.

Peer reviewed

Bernal, Ernesto M. – Hispanic Journal of Behavioral Sciences, 2000

Examines some problems of the Texas Assessment of Academic Skills (TAAS): multiple cutoff scores for passing the test and receiving a high school diploma, and artificially "tricky" items that disproportionately confuse language-minority students. Uses factor analysis and simulations to show ways to improve item selection for the TAAS.…

Descriptors: Black Students, Construct Validity, Elementary Secondary Education, Factor Analysis

Previous Page | Next Page »

Pages: 1 | 2

Austin, J. Sue	1
Bernal, Ernesto M.	1
Bourque, Mary Lyn	1
Braun, Henry I.	1
Burstein, Leigh	1
Byrne, Karen E.	1
Cui, Ying	1
Eiting, Mindert H.	1
Eli, Jennifer A.	1
Figueroa, Richard A.	1
Geisinger, Kurt F.	1
Glascoe, Frances Page	1
Grabovsky, Irina	1
Hambleton, Ronald K.	1
Hass, Michael R.	1
Jones, Marshall B.	1
Jong, Cindy	1
Kahl, Stuart R.	1
Karelitz, Tzur M.	1
Kim, Ok-Kyeong	1
Lance, Charles E.	1
Leighton, Jacqueline P.	1
Leventhal, Brian C.	1
Lewis, Charles	1
More ▼