ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Descriptor

Construct Validity	25
Standardized Tests	25
Test Validity	25
Test Reliability	9
Language Tests	8
English (Second Language)	7
Test Construction	7
Evaluation Methods	5
Higher Education	5
Second Language Instruction	5
Second Language Learning	5
Achievement Tests	4
Psychometrics	4
Scores	4
Student Evaluation	4
Test Items	4
Content Validity	3
Correlation	3
Elementary School Students	3
Elementary Secondary Education	3
Inferences	3
Language Proficiency	3
Scoring	3
Academic Achievement	2
Behavior Rating Scales	2
More ▼

Publication Type

Journal Articles	19
Reports - Evaluative	12
Reports - Research	12
Speeches/Meeting Papers	3
Opinion Papers	2
Dissertations/Theses -…	1

Education Level

Elementary Education	3
Early Childhood Education	2
Preschool Education	2
Elementary Secondary Education	1
Grade 1	1
High Schools	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Researchers

Location

Japan	1
New York	1
South Korea	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Bayley Scales of Infant…	1
California Psychological…	1
Scales of Independent Behavior	1
Test of English as a Foreign…	1
Test of English for…	1
Test of Written English	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Validation of a Standardized Multiple-Choice Multicultural Competence Test: Implications for Training, Assessment, and Practice

Peer reviewed

Direct link

Gillem, Angela R.; Bartoli, Eleonora; Bertsch, Kristin N.; McCarthy, Maureen A.; Constant, Kerra; Marrero-Meisky, Sheila; Robbins, Steven J.; Bellamy, Scarlett – Journal of Multicultural Counseling and Development, 2016

The Multicultural Counseling and Psychotherapy Test (MCPT), a measure of multicultural counseling competence (MCC), was validated in 2 phases. In Phase 1, the authors administered 451 test items derived from multicultural guidelines in counseling and psychology to 32 multicultural experts and 30 nonexperts. In Phase 2, the authors administered the…

Descriptors: Counseling Techniques, Cultural Relevance, Counselor Qualifications, Expertise

A Review of Standardizing an English Second Language Test-Item through Reverse Engineering

Peer reviewed
PDF on ERIC

Download full text

Foghahaee, Zahra – Language Teaching Research Quarterly, 2019

Reverse engineering (RE) can play an important role in the re-designing tests in L2 English. It can also enrich the aim of teaching the same as raising children through academic achievement. In addition, it can play a key role in helping students understand how much their test is valid by using Standard reverse engineering (SRE). This paper is a…

Descriptors: Language Tests, Second Language Learning, Second Language Instruction, English (Second Language)

The Cognitive Validity of Child English Language Tests: What Young Language Learners and Their Native-Speaking Peers Can Reveal

Peer reviewed

Direct link

Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018

This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…

Descriptors: Language Tests, English, English (Second Language), Second Language Learning

Ensuring Validity of Practical English Certification Test of Local Office of Education in Korea

Peer reviewed
PDF on ERIC

Download full text

Kang, Mun-koo; Chang, Hyung-ji – Journal of Pan-Pacific Association of Applied Linguistics, 2014

This study is aimed at ensuring the validity of the Practical English Certification Test (PECT) of the Chung-nam Office of Education (COE) in Korea. Motivated by the demand for a developing localized English test to empower English learning in public education, the COE conducted the PECT for 38,544 students of elementary, middle and high schools…

Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Language Tests

A Standardized Tool for Assessing the Quality of Classroom-Based Shared Reading: Systematic Assessment of Book Reading (SABR)

Peer reviewed

Direct link

Pentimonti, Jill M.; Zucker, Tricia A.; Justice, Laura M.; Petscher, Yaacov; Piasta, Shayne B.; Kaderavek, Joan N. – Early Childhood Research Quarterly, 2012

Participation in shared-reading experiences is associated with children's language and literacy outcomes, yet few standardized assessments of shared-reading quality exist. The purpose of this study was to describe the psychometric characteristics of the Systematic Assessment of Book Reading (SABR), an observational tool designed to characterize…

Descriptors: Test Validity, Construct Validity, Interrater Reliability, Factor Structure

A "Conditional" Sense of Fairness in Assessment

Peer reviewed

Direct link

Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013

Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…

Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics

Item-Level and Construct Evaluation of Early Numeracy Curriculum-Based Measures

Peer reviewed

Direct link

Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012

The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…

Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten

"Comments on Slavin": Synthesizing Causal Inferences

Peer reviewed

Direct link

Briggs, Derek C. – Educational Researcher, 2008

When causal inferences are to be synthesized across multiple studies, efforts to establish the magnitude of a causal effect should be balanced by an effort to evaluate the generalizability of the effect. The evaluation of generalizability depends on two factors that are given little attention in current syntheses: construct validity and external…

Descriptors: Test Validity, Construct Validity, Inferences, Educational Policy

Developing a Test of Pragmatics of Japanese as a Foreign Language

Direct link

Itomitsu, Masayuki – ProQuest LLC, 2009

This dissertation reports development and validation studies of a Web-based standardized test of Japanese as a foreign language (JFL), designed to measure learners' off-line grammatical and pragmatic knowledge in multiple-choice format. Targeting Japanese majors in the U.S. universities and colleges, the test is designed to explore possible…

Descriptors: Sentences, Speech Acts, Grammar, Second Language Learning

Background, College Experiences, and the ACT-COMP Exam: Using Construct Validity to Evaluate Assessment Instruments.

Peer reviewed

Pike, Gary R. – Review of Higher Education, 1989

A study investigated the appropriateness of the American College Testing Program's College Outcome Measures Program, conducted at the University of Tennessee, Knoxville, by applying the criterion of construct validity. Results indicated that while the test primarily measures individual differences, it is also sensitive to the effects of higher…

Descriptors: Construct Validity, Educational Quality, Evaluation Criteria, Higher Education

A More Valid Alternative to TOEFL?

Peer reviewed

Direct link

Roemer, Ann – College and University, 2002

Describes the Test of English as a Foreign Language (TOEFL) and the Advanced Placement in International English Language (APIEL) and evaluates both tests on three basic types of validity criteria: content, construct, and criterion-related. Concludes that the TOEFL has serious limitations, and that the APIEL may be more useful. (EV)

Descriptors: Construct Validity, Content Validity, English (Second Language), Foreign Students

Examining the Construct Validity for the Multiple-Content Testing Programs

Peer reviewed

Direct link

Li, Yuan H.; Tompkins, Leroy J. – International Journal of Testing, 2004

The primary objective of this study was to examine the construct validity for the 2 multiple-content testing programs-the multiple-choice Comprehensive Tests of Basic Skills (CTBS/5) together with the performance-based Maryland School Performance Assessment Program (MSPAP)-by evaluating the true-score longitudinal associations among…

Descriptors: Testing Programs, Structural Equation Models, Performance Based Assessment, Multitrait Multimethod Techniques

Evidence of Construct Validity in Published Achievement Tests.

Download full text

Nolet, Victor; Tindal, Gerald – 1990

Valid interpretation of test scores is the shared responsibility of the test designer and the test user. Test publishers must provide evidence of the validity of the decisions their tests are intended to support, while test users are responsible for analyzing this evidence and subsequently using the test in the manner indicated by the publisher.…

Descriptors: Achievement Tests, Construct Validity, Elementary Secondary Education, Norm Referenced Tests

Rasch Analysis of the Standardization Data of the Bayley Mental Scale of Infant Development.

Snyder, Scott; Sheehan, Robert – Diagnostique, 1992

Rasch calibration procedures were applied to item-response data for the 1,262 infants and toddlers comprising the standardization sample for the Mental Scale of the Bayley Scales of Infant Development. Analyses tend to confirm the psychometric integrity of the instrument. (Author)

Descriptors: Child Development, Cognitive Tests, Concurrent Validity, Construct Validity

Factorial Validity of the KeyMath-Revised.

Walker, David W.; Arnault, Lynne S. – Diagnostique, 1991

This study examined the construct validity of the KeyMath-Revised by testing the factorial model proposed by the test author. Results failed to confirm the proposed factorial model and suggested that the KeyMath-Revised assesses two domains that are difficult to interpret, rather than the three proposed by the test author. (Author/JDD)

Descriptors: Achievement Tests, Construct Validity, Diagnostic Tests, Elementary Secondary Education

Previous Page | Next Page »

Pages: 1 | 2

Diagnostique	2
American Educational Research…	1
American Journal on Mental…	1
Assessment for Effective…	1
College and University	1
Early Childhood Research…	1
Early Education and…	1
Educational Research and…	1
Educational Researcher	1
Educational Studies	1
International Journal of…	1
Journal of Communication…	1
Journal of Educational…	1
Journal of Multicultural…	1
Journal of Pan-Pacific…	1
Language Teaching Research…	1
Online Submission	1
ProQuest LLC	1
Review of Higher Education	1
TESOL Quarterly: A Journal…	1
More ▼

Nakamura, Yuji	2
Ackerman, Terry A.	1
Ahn, Jieun Irene	1
Arnault, Lynne S.	1
Banta, Trudy W.	1
Bartoli, Eleonora	1
Bellamy, Scarlett	1
Bertsch, Kristin N.	1
Briggs, Derek C.	1
Chang, Hyung-ji	1
Cheng, Britte H.	1
Choi, Ina	1
Colker, Alexis M.	1
Constant, Kerra	1
Cui, Yaqiong	1
DeBarger, Angela	1
Facione, Peter A.	1
Foghahaee, Zahra	1
Forsythe, George B.	1
Gillem, Angela R.	1
Ginsburg, Herbert P.	1
Gravel, Jenna	1
Haertel, Geneva	1
Itomitsu, Masayuki	1
More ▼