ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Source

Journal of Educational…	2
Applied Psychological…	1
Assessment	1
College Student Journal	1
Educational Measurement:…	1
Educational and Psychological…	1
Evaluation and Research in…	1
Evaluation in Education:…	1
International Journal of…	1
OECD Publishing (NJ1)	1
Psychometrika	1
Studies in Educational…	1
More ▼

Publication Type

Reports - Evaluative	21
Journal Articles	12
Speeches/Meeting Papers	6
Information Analyses	1
Numerical/Quantitative Data	1
Reports - Research	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Secondary Education	1

Audience

Location

Netherlands

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Minnesota Multiphasic…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Preequating with Empirical Item Characteristic Curves: An Observed-Score Preequating Method

Peer reviewed

Direct link

Zu, Jiyun; Puhan, Gautam – Journal of Educational Measurement, 2014

Preequating is in demand because it reduces score reporting time. In this article, we evaluated an observed-score preequating method: the empirical item characteristic curve (EICC) method, which makes preequating without item response theory (IRT) possible. EICC preequating results were compared with a criterion equating and with IRT true-score…

Descriptors: Item Response Theory, Equated Scores, Item Analysis, Item Sampling

NCME 2008 Presidential Address: The Impact of Anchor Test Configuration on Student Proficiency Rates

Peer reviewed

Direct link

Fitzpatrick, Anne R. – Educational Measurement: Issues and Practice, 2008

Examined in this study were the effects of reducing anchor test length on student proficiency rates for 12 multiple-choice tests administered in an annual, large-scale, high-stakes assessment. The anchor tests contained 15 items, 10 items, or five items. Five content representative samples of items were drawn at each anchor test length from a…

Descriptors: Test Length, Multiple Choice Tests, Item Sampling, Student Evaluation

Commingled Samples: A Neglected Source of Bias in Reliability Analysis

Peer reviewed

Direct link

Waller, Niels G. – Applied Psychological Measurement, 2008

Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…

Descriptors: Test Items, Reliability, Scores, Psychometrics

PISA 2009 Technical Report

Direct link

OECD Publishing (NJ1), 2012

The "PISA 2009 Technical Report" describes the methodology underlying the PISA 2009 survey. It examines additional features related to the implementation of the project at a level of detail that allows researchers to understand and replicate its analyses. The reader will find a wealth of information on the test and sample design,…

Descriptors: Quality Control, Research Reports, Research Methodology, Evaluation Criteria

Lessons Learned from the Use of Randomized and Quasi-Experimental Field Designs for the Evaluation of Educational Programs

Peer reviewed

Direct link

Rudd, Andy; Johnson, R. Burke – Studies in Educational Evaluation, 2008

As a result of the federal No Child Left Behind Act (NCLB) of 2002, the field of education has seen a heavy emphasis on the use of "scientifically based research" for designing and testing the effectiveness of new and existing educational programs. According to NCLB, when addressing basic cause and effect questions scientifically based…

Descriptors: Quasiexperimental Design, Scientific Research, Educational Research, Federal Legislation

Infeasibility in Automated Test Assembly Models: A Comparison Study of Different Methods

Peer reviewed

Direct link

Huitzing, Hiddo A.; Veldkamp, Bernard P.; Verschoor, Angela J. – Journal of Educational Measurement, 2005

Several techniques exist to automatically put together a test meeting a number of specifications. In an item bank, the items are stored with their characteristics. A test is constructed by selecting a set of items that fulfills the specifications set by the test assembler. Test assembly problems are often formulated in terms of a model consisting…

Descriptors: Testing Programs, Programming, Mathematics, Item Sampling

Measuring the Character Strength of Wisdom

Peer reviewed

Direct link

Webster, Jeffrey Dean – International Journal of Aging and Human Development, 2007

This study examined the psychosocial correlates and psychometric properties of the Self-Assessed Wisdom Scale (SAWS) (Webster, 2003a). Seventy-three men and 98 women ranging in age from 17-92 years (Mean age = 42.77) completed an expanded, 40-item version of the SAWS, the Loyola Generativity Scale, and the Experiences in Close Relationships Scale.…

Descriptors: Measures (Individuals), Psychometrics, Construct Validity, Correlation

On the Efficiency of IRT Models When Applied to Different Sampling Designs. Project Psychometric Aspects of Item Banking No. 45.

Berger, Martijn P. F. – 1989

The problem of obtaining designs that result in the most precise parameter estimates is encountered in at least two situations where item response theory (IRT) models are used. In so-called two-stage testing procedures, certain designs that match difficulty levels of the test items with the ability of the examinees may be located. Such designs…

Descriptors: Difficulty Level, Efficiency, Equations (Mathematics), Heuristics

Violating Conventional Wisdom in Multiple Choice Test Construction

Peer reviewed

Taylor, Annette Kujawski – College Student Journal, 2005

This research examined 2 elements of multiple-choice test construction, balancing the key and optimal number of options. In Experiment 1 the 3 conditions included a balanced key, overrepresentation of a and b responses, and overrepresentation of c and d responses. The results showed that error-patterns were independent of the key, reflecting…

Descriptors: Comparative Analysis, Test Items, Multiple Choice Tests, Test Construction

Binomial Test Models for Domain-Referenced Testing.

van den Brink, Wulfert – Evaluation in Education: International Progress, 1982

Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques

An Evaluation of the MMPI-2 and MMPI-A True Response Inconsistency (TRIN) Scales

Peer reviewed

Direct link

Handel, Richard W.; Arnau, Randolph C.; Archer, Robert P.; Dandy, Kristina L. – Assessment, 2006

The Minnesota Multiphasic Personality Inventory--Adolescent (MMPI-A) and Minnesota Multiphasic Personality Inventory--2 (MMPI-2) True Response Inconsistency (TRIN) scales are measures of acquiescence and nonacquiescence included among the standard validity scales on these instruments. The goals of this study were to evaluate the effectiveness of…

Descriptors: Adolescents, Protocol Analysis, Effect Size, Personality Measures

Conceptualization of Issues in Construct and Content Validity. Studies in Measurement and Methodology, Work Unit No. 1: Conceptual and Design Problems in Competency-Based Measurements.

Linn, Robert – 1978

A series of studies on conceptual and design problems in competency-based measurements are explained. The concept of validity within the context of criterion-referenced measurement is reviewed. The authors believe validation should be viewed as a process rather than an end product. It is the process of marshalling evidence to support…

Descriptors: Criterion Referenced Tests, Item Analysis, Item Sampling, Test Bias

The Evaluation of a Model for the Assessment of Class Progress.

Download full text

Upp, Caroline M.; Barcikowski, Robert S. – 1981

Demands for more complete information on educational programs have emanated from national, state and local sources. Their focus is on the processes that are occurring in individual classrooms. The information that is collected to provide insight into educational programs is customarily summative in nature, answering, for example, questions…

Descriptors: Academic Achievement, Attitude Measures, Cognitive Measurement, Evaluation Methods

Ordinal Test Fidelity Estimated by an Item Sampling Model.

Peer reviewed

Cliff, Norman; Donoghue, John R. – Psychometrika, 1992

A test theory using only ordinal assumptions is presented, based on the idea that the test items are a sample from a universe of items. The sum across items of the ordinal relations for a pair of persons on the universe items is analogous to a true score. (SLD)

Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Response Theory, Item Sampling

The Use of Surveys in a Responsive Evaluation: Evaluation of a National Sex Equity Demonstration Project.

Stake, Bernadine Evans; And Others – 1983

During the last 2 years (1980-82), selected schools in the Broward County School District in Florida participated in the National Sex Equity Demonstration Project (NSEDP) to create a model for demonstration of curricular materials, educational practices, and program arrangements that feature gender-fair instruction and associated educational…

Descriptors: Administrator Attitudes, Demonstration Programs, Elementary Secondary Education, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2

Item Sampling	21
Test Items	10
Test Construction	8
Evaluation Criteria	4
Evaluation Methods	4
Language Tests	4
Test Reliability	4
Test Validity	4
Academic Achievement	3
Comparative Analysis	3
Data Analysis	3
Formative Evaluation	3
Item Analysis	3
Item Response Theory	3
Measurement Techniques	3
Program Validation	3
Psychometrics	3
Test Theory	3
Criterion Referenced Tests	2
Difficulty Level	2
English (Second Language)	2
Equations (Mathematics)	2
Evaluation Research	2
Foreign Countries	2
Higher Education	2
More ▼

Archer, Robert P.	1
Arnau, Randolph C.	1
Barcikowski, Robert S.	1
Berger, Martijn P. F.	1
Bors, Douglas A.	1
Cliff, Norman	1
Dandy, Kristina L.	1
Donoghue, John R.	1
Fitzpatrick, Anne R.	1
Handel, Richard W.	1
Huitzing, Hiddo A.	1
Johnson, R. Burke	1
Liang, Xin	1
Linn, Robert	1
Mason, Victor W.	1
Nation, Paul	1
Puhan, Gautam	1
Read, John	1
Rudd, Andy	1
Stake, Bernadine Evans	1
Taylor, Annette Kujawski	1
Theunissen, Phiel J. J. M.	1
Upp, Caroline M.	1
Veldkamp, Bernard P.	1
More ▼