ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Evaluation Methods	24
Test Items	24
Test Use	24
Test Construction	16
Student Evaluation	9
Test Validity	8
Educational Assessment	6
Elementary Secondary Education	6
Test Reliability	6
Foreign Countries	5
Test Bias	5
Test Content	5
Test Interpretation	5
Achievement Tests	4
Guidelines	4
Psychometrics	4
Scores	4
Scoring	4
Standardized Tests	4
Test Results	4
Academic Standards	3
Adaptive Testing	3
Computer Assisted Testing	3
Educational Improvement	3
Educational Objectives	3
More ▼

Source

Journal of Educational…	2
Adolescence	1
Applied Measurement in…	1
Assessment and Accountability…	1
Center for Assessment and…	1
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Horizons	1
International Journal of…	1
Ministerial Council on…	1
Psychology in the Schools	1
More ▼

Publication Type

Journal Articles	9
Reports - Research	6
Speeches/Meeting Papers	6
Reports - Descriptive	5
Reports - Evaluative	5
Tests/Questionnaires	3
Guides - Classroom - Teacher	2
Opinion Papers	2
Book/Product Reviews	1
Books	1
Guides - Classroom - Learner	1
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Elementary Secondary Education	3
Elementary Education	1
Grade 6	1

Audience

Practitioners	4
Teachers	2
Community	1
Parents	1
Students	1

Location

Australia	1
South Korea	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	1
Pennsylvania Educational…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

A Modified "a"-Stratified Method for Computerized Adaptive Testing. Research Report. ETS RR-19-10

Peer reviewed
PDF on ERIC

Download full text

Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019

Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

Counterbalance Assessment: The Chorizo Test

Peer reviewed

Direct link

Cabrera, Nolan L.; Cabrera, George A. – Educational Horizons, 2011

Just like all the high-stakes tests that determine students' futures nowadays, The Chorizo Test is a standardized test rooted in the culture of the test makers. It was originally created to be used with students in teacher training programs to sensitize them to the pitfalls inherent in standardized pencil-and-paper tests, such as linguistic bias…

Descriptors: Test Use, Standardized Tests, Social Sciences, High Stakes Tests

Benchmark Assessment for Improved Learning. AACC Report

Download full text

Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010

This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…

Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment

Guidelines for Computerized-Adaptive Test Development and Use in Education [Book Review].

Peer reviewed

Eignor, Daniel R. – Journal of Educational Measurement, 1997

The authors of the "Guidelines," a task force of eight, intend to present an organized list of features to be considered in reporting or evaluating computerized-adaptive assessments. Apart from a few weaknesses, the book is a useful and complete document that will be very helpful to test developers. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Evaluation Methods, Guidelines

An Assessment Instrument for Identifying Counseling Needs of Elementary-Aged Students: The Multimodal Sentence Completion Form for Children (MSCF-C).

Peer reviewed

Gamble, Charles W.; Hamblin, Arthur G. – Psychology in the Schools, 1986

Discusses the use of a sentence completion instrument predicated on Lazarus' multimodal system. The instrument, entitled The Multimodal Sentence Completion Form for Children (MSCF-C), is designed to systematically assess client needs and assist in identifying intervention strategies. Presents a case study of a 12-year-old, sixth-grade student.…

Descriptors: Case Studies, Counseling, Elementary Education, Elementary School Students

Using Objective Tests To Evaluate.

Download full text

Parsons, Jim; Fenwick, Tara – 1999

This "toolbox" offers suggestions about how and when to create objective tests. Such tests are sometimes a quick way to find out how students are doing, and sometimes they help students focus on what they are doing in class or help teachers define the content that is worth knowing. The following suggestions are offered for developing objective…

Descriptors: Elementary Secondary Education, Evaluation Methods, Foreign Countries, Objective Tests

Using Multidimensional Item Response Theory to Understand What Items and Tests Are Measuring.

Peer reviewed

Ackerman, Terry A. – Applied Measurement in Education, 1994

When item response data do not satisfy the unidimensionality assumption, multidimensional item response theory (MIRT) should be used to model the item-examinee interaction. This article presents and discusses MIRT analyses designed to give better insight into what individual items are measuring. (SLD)

Descriptors: Evaluation Methods, Item Response Theory, Measurement Techniques, Models

Methods of Assessing Bias and Fairness in Tests.

Merz, William R. – 1980

Several methods of assessing test item bias are described, and the concept of fair use of tests is examined. A test item is biased if individuals of equal ability have different probabilities of attaining the item correct. The following seven general procedures used to examine test items for bias are summarized and discussed: (1) analysis of…

Descriptors: Comparative Analysis, Evaluation Methods, Factor Analysis, Mathematical Models

Test Bias and the Culturally Different Early Adolescent.

Peer reviewed

Roberts, Eileen; DeBlassie, Richard R. – Adolescence, 1983

Defines test bias as a phenomenon in which test scores result in negative outcomes for certain groups, often lower socioeconomic groups and minorities. Discusses three manifestations of test bias including content, atmosphere, and use bias and presents recommendations for remedying bias problems in testing the culturally different. (JAC)

Descriptors: Adolescents, Cultural Differences, Evaluation Methods, Intelligence Tests

Simulation Based Discovery Environments and Acquisition, the Features, and Assessment of Intuitive Knowledge.

Swaak, Janine; And Others – 1997

A study was conducted to develop a test that is able to capture knowledge of an intuitive nature, such as that acquired through discovery learning. The proposed test format is called the "what-if test." Test items in this format consist of the presentation of a situation. A change in the situation is introduced, and learners have to…

Descriptors: College Students, Discovery Learning, Educational Assessment, Evaluation Methods

What Parents Should Know about Test Accuracy and Use. Assessment Brief. Number 4

Download full text

Dietel, Ron – Center for Assessment and Evaluation of Student Learning (CAESL) at WestEd, 2004

The accuracy and fairness of standardized testing is taken very seriously in the education world. These issues are a major focus of both the testing experts who develop standardized tests and the researchers who endeavor to ensure a test's fairness, reliability, validity, and accuracy. But many issues remain both controversial and complex. The…

Descriptors: Testing, Standardized Tests, Test Items, Test Bias

Should Achievement Tests Be Used To Judge School Quality?

Peer reviewed

Bauer, Scott C. – Education Policy Analysis Archives, 2000

Studied whether student scores on standardized tests represent reasonable measures of instructional quality using ratings by 10 parents and 11 educators (school principals) of the degree to which test items from a nationally marketed standardized achievement test represent the content actually taught. On average, raters felt that test items…

Descriptors: Achievement Tests, Educational Quality, Elementary Secondary Education, Evaluation Methods

Basic Precepts in Test Construction.

Download full text

Buser, Karen – 1996

Most seasoned test developers recognize the importance of thoughtful decision making when constructing a test. Unfortunately, many classroom achievement tests are created by novice test developed who have not received sufficient instruction in item writing (G. Gulliksen, 1986; R. J. Stiggins, 1991). The result is often a test that is poorly…

Descriptors: Achievement Tests, Decision Making, Educational Planning, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2

Eignor, Daniel R.	2
Ackerman, Terry A.	1
Bauer, Scott C.	1
Bishop, Laurence A.	1
Buser, Karen	1
Cabrera, George A.	1
Cabrera, Nolan L.	1
DeBlassie, Richard R.	1
Dietel, Ron	1
Dietel, Ronald	1
Donovan, Jenny	1
Fenwick, Tara	1
Gamble, Charles W.	1
Gredler, Margaret E.	1
Gu, Lixiong	1
Hambleton, Ronald K.	1
Hamblin, Arthur G.	1
Herman, Joan	1
Herman, Joan L.	1
Hutton, Penny	1
Kang, Gyenam Kim	1
Kylie Gorney	1
Lee, Yeounwoo	1
Lennon, Melissa	1
More ▼