ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	8

Descriptor

Testing Problems	37
Sampling	31
Test Reliability	10
Equated Scores	9
Statistical Analysis	9
Test Construction	9
Test Items	9
Item Sampling	8
Achievement Tests	7
Elementary Secondary Education	7
Test Validity	7
Educational Assessment	6
Item Analysis	6
Educational Testing	5
Error of Measurement	5
Evaluation Methods	5
College Entrance Examinations	4
Data Analysis	4
Foreign Countries	4
Mathematical Models	4
Standardized Tests	4
Test Bias	4
Test Interpretation	4
Testing	4
Testing Programs	4
More ▼

Source

Applied Measurement in…	2
Journal of Educational…	2
Council of the Great City…	1
Educational Measurement:…	1
Journal of Experimental…	1
Journal of School Psychology	1
North American Chapter of the…	1
Online Submission	1
Psychometrika	1
Sage Research Methods Cases	1
Studies in Educational…	1
World Journal of Education	1
More ▼

Publication Type

Reports - Research	37
Journal Articles	10
Speeches/Meeting Papers	10
Books	1
Guides - Non-Classroom	1
Information Analyses	1
Non-Print Media	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	2
Secondary Education	2
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Ireland (Dublin)	1
Nigeria	1
United States	1
West Germany	1

Laws, Policies, & Programs

Elementary and Secondary…	1
Emergency School Aid Act 1972	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Perkins Loan Program	1

Assessments and Surveys

National Assessment of…	3
Armed Services Vocational…	1
California Achievement Tests	1
Graduate Record Examinations	1
Myers Briggs Type Indicator	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1
Test of English as a Foreign…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Scale Development: Identifying and Addressing Potential Validity Threats Linked with Online Piloting Using Paid-For Samples. Sage Research Methods: Doing Research Online

Direct link

Zita Lysaght; Michael O'Leary; Angela Mazzone; Conor Scully – Sage Research Methods Cases, 2022

Since 2018, colleagues from two research centers at Dublin City University have been collaborating to develop a measurement scale to assess individuals' ability to identify workplace bullying. Having agreed on an operational definition of the construct, an item pool of 26 workplace bullying scenarios, that is, short descriptions of…

Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability

High School Students' Misconceptions about Significance Testing with a Repeated Sampling Approach = Dificultades de estudiantes de bachillerato sobre pruebas de significación a través de un enfoque de muestreo repetido

Peer reviewed
PDF on ERIC

Download full text

Sánchez Sánchez, Ernesto; García Rios, Víctor N.; Silvestre Castro, Eleazar; Licea, Guadalupe Carrasco – North American Chapter of the International Group for the Psychology of Mathematics Education, 2020

In this paper, we address the following questions: What misconceptions do high school students exhibit in their first encounter with significance test problems through a repeated sampling approach? Which theory or framework could explain the presence and features of such patterns? With brief prior instruction on the use of Fathom software to…

Descriptors: High School Students, Misconceptions, Statistical Significance, Testing

Investigating Repeater Effects on Small Sample Equating: Include or Exclude?

Peer reviewed

Direct link

Diao, Hongyu; Keller, Lisa – Applied Measurement in Education, 2020

Examinees who attempt the same test multiple times are often referred to as "repeaters." Previous studies suggested that repeaters should be excluded from the total sample before equating because repeater groups are distinguishable from non-repeater groups. In addition, repeaters might memorize anchor items, causing item drift under a…

Descriptors: Licensing Examinations (Professions), College Entrance Examinations, Repetition, Testing Problems

Counselling Strategies for Curbing Examination Malpractices in Secondary Schools in Enugu State, Nigeria

Peer reviewed
PDF on ERIC

Download full text

Egbo, Anthonia Chinonyelum – World Journal of Education, 2015

This study investigated the Counselling strategies for curbing "Examination Malpractices" in Secondary Schools in Enugu State Nigeria. The researcher used three research questions. The Design used was a descriptive survey design. Sample consisted of 335 respondents comprising principals (N = 19), PTA secretaries (N = 19), teachers (N =…

Descriptors: Counseling Techniques, Questionnaires, Foreign Countries, Surveys

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Investigating Effect of Ignoring Hierarchical Data Structures on Accuracy of Vertical Scaling Using Mixed-Effects Rasch Model

Download full text

Wang, Shudong; Jiao, Hong; Jin, Ying; Thum, Yeow Meng – Online Submission, 2010

The vertical scales of large-scale achievement tests created by using item response theory (IRT) models are mostly based on cluster (or correlated) educational data in which students usually are clustered in certain groups or settings (classrooms or schools). While such application directly violated assumption of independent sample of person in…

Descriptors: Scaling, Achievement Tests, Data Analysis, Item Response Theory

Student Testing in America's Great City Schools: An Inventory and Preliminary Analysis

Download full text

Hart, Ray; Casserly, Michael; Uzzell, Renata; Palacios, Moses; Corcoran, Amanda; Spurgeon, Liz – Council of the Great City Schools, 2015

There has been little data collected on how much testing actually goes on in America's schools and how the results are used. So in the Spring of 2014, the Council staff developed and launched a survey of assessment practices. This report presents the findings from that survey and subsequent Council analysis and review of the data. It also offers…

Descriptors: Urban Schools, Student Evaluation, Testing Programs, Testing

A Note on Allocating Items to Subtests in Multiple Matrix Sampling.

Download full text

Shoemaker, David M. – 1972

Investigated empirically through post mortem item-examinee sampling were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. The results indicate clearly that a partially balanced incomplete block…

Descriptors: Error of Measurement, Item Sampling, Matrices, Sampling

Exact Tests for the Rasch Model via Sequential Importance Sampling

Peer reviewed

Direct link

Chen, Yuguo; Small, Dylan – Psychometrika, 2005

Rasch proposed an exact conditional inference approach to testing his model but never implemented it because it involves the calculation of a complicated probability. This paper furthers Rasch's approach by (1) providing an efficient Monte Carlo methodology for accurately approximating the required probability and (2) illustrating the usefulness…

Descriptors: Testing Problems, Probability, Methods, Testing

Why Should All Those Students Take All Those Tests? (Every-Student Testing or Sampling of Selected Groups?).

Download full text

National Education Association, Washington, DC. – 1975

The National Education Association's Task Force on Testing has stated its opinion that standardized tests are overused. The task force suggests that the application of sampling techniques and a variety of alternatives to current testing practices would accomplish the same purposes. Representatives of the testing industry have indicated that the…

Descriptors: Accountability, Alternative Assessment, Cost Effectiveness, Educational Testing

Minimizing Context Effect When Using Multiple Matrix Sampling.

Download full text

Hill, Richard K. – 1975

This study is an a priori demonstration of the applicability of multiple matrix sampling techniques to the practical research problem of parameter estimation. Three tests were administered to two separate but parallel populations, with one receiving item samples and the other receiving full tests. Special efforts were made to minimize the context…

Descriptors: Bias, Item Sampling, Matrices, Standardized Tests

The Myers-Briggs Type Indicator: Analysis of Discrepancy Score Phenomenon in a Real World Sample.

Download full text

Hoover, Randy L.; Kadunc, Nancy – 1983

The purpose of this paper is to examine the nature of discrepancy score phenomena of the Myers-Briggs Type Indicator (MBTI), as related to internal consistency and construct validity of the instrument. Data were collected from 140 university research managers. The data suggest internal consistency problems: only 37.3 percent of the subjects…

Descriptors: Adults, Personality Measures, Personality Traits, Sampling

The Standard Error of Equipercentile Equating.

Download full text

Lord, Frederic M. – 1981

Transformations or equating of raw test scores on two or more forms of the same test are made interchangeable by empirical procedures deriving the standard error of an equipercentile equating for four different situations. Some numerical results are checked by Monte Carlo methods. Numerical standard errors are computed for two sets of real data.…

Descriptors: Educational Testing, Equated Scores, Error of Measurement, Mathematical Formulas

Item Pool Construction for Use With Latent Trait Models.

PDF pending restoration

Reckase, Mark D. – 1979

Because latent trait models require that large numbers of items be calibrated or that testing of the same large group be repeated, item parameter estimates are often obtained by administering separate tests to different groups and "linking" the results to construct an adequate item pool. Four issues were studied, based upon the analysis…

Descriptors: Achievement Tests, High Schools, Item Banks, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2 | 3

Wilcox, Rand R.	2
Angela Mazzone	1
Angoff, William H.	1
Askegaard, Lewis D.	1
Boser, Judith A.	1
Boyd, Thomas A.	1
Cahen, Leonard S.	1
Casserly, Michael	1
Chen, Yuguo	1
Conor Scully	1
Corcoran, Amanda	1
Cowell, William R.	1
Diao, Hongyu	1
Doron, Rina	1
Egbo, Anthonia Chinonyelum	1
Flanagan, John C.	1
García Rios, Víctor N.	1
Hart, Ray	1
Hicks, Marilyn M.	1
Hill, Richard K.	1
Hoover, Randy L.	1
Ingels, Steven J.	1
Jiao, Hong	1
Jin, Ying	1
More ▼