ERIC - Search Results

Publication Date

In 2026	7
Since 2025	690
Since 2022 (last 5 years)	3191
Since 2017 (last 10 years)	7432
Since 2007 (last 20 years)	15070

Descriptor

Test Reliability	15055
Test Validity	10290
Reliability	9763
Foreign Countries	7150
Test Construction	4828
Validity	4192
Measures (Individuals)	3880
Factor Analysis	3826
Psychometrics	3532
Interrater Reliability	3126
Correlation	3040
Evaluation Methods	2749
Statistical Analysis	2533
Higher Education	2515
Questionnaires	2476
Scores	2387
College Students	2211
Student Attitudes	2149
Comparative Analysis	1943
Factor Structure	1823
Student Evaluation	1695
Rating Scales	1623
Measurement Techniques	1562
Test Items	1528
Construct Validity	1498
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	19257
Reports - Research	17444
Reports - Evaluative	3329
Speeches/Meeting Papers	1861
Tests/Questionnaires	1600
Reports - Descriptive	1544
Information Analyses	958
Dissertations/Theses -…	673
Opinion Papers	645
Guides - Non-Classroom	325
Numerical/Quantitative Data	252
Books	135
Guides - Classroom - Teacher	81
Reports - General	70
Guides - General	57
Reference Materials -…	53
Collected Works - General	40
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Non-Print Media	22
Dissertations/Theses	21
ERIC Digests in Full Text	20
More ▼

Education Level

Higher Education	4728
Postsecondary Education	3742
Secondary Education	2275
Elementary Education	2199
High Schools	1087
Middle Schools	1034
Elementary Secondary Education	876
Early Childhood Education	874
Junior High Schools	716
Primary Education	427
Intermediate Grades	401
Preschool Education	385
Grade 5	342
Grade 8	325
Grade 4	318
Grade 6	299
Grade 7	279
Grade 3	270
Kindergarten	267
Adult Education	211
Grade 1	202
Grade 2	173
Grade 9	154
Grade 10	140
Grade 11	109
More ▼

Audience

Researchers	709
Practitioners	451
Teachers	208
Administrators	122
Policymakers	66
Counselors	42
Students	38
Parents	11
Community	7
Support Staff	6
Media Staff	5
More ▼

Location

Turkey	1329
Australia	436
Canada	379
China	368
United States	271
United Kingdom	256
Indonesia	253
Taiwan	234
Netherlands	224
Spain	218
California	215
Germany	197
United Kingdom (England)	192
Malaysia	170
Hong Kong	161
Florida	159
Iran	156
Nigeria	149
South Korea	135
Texas	134
India	127
New York	119
Pennsylvania	114
South Africa	109
Japan	106
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 17,206 to 17,220 of 27,122 results Save | Export

The Probability of Obtaining Two Statistically Different Test Scores as a Test Index

Peer reviewed

Direct link

Muller, Jorg M. – Educational and Psychological Measurement, 2006

A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…

Descriptors: Test Reliability, Probability, Scores, Item Response Theory

Encouraging and Supporting Compliance with Standards for Educational Tests

Peer reviewed

Direct link

Wise, Lauress L. – Educational Measurement: Issues and Practice, 2006

Uses and consequences of educational testing have increased dramatically in recent years. Professional standards to ensure fair treatment of all affected by test results are more important than ever, but standards for developing and using educational tests are only helpful if they are followed. Test developers and users each have a role to play in…

Descriptors: Educational Testing, Standards, Accountability, Cooperation

Metacognitive Experiences: The Missing Link in the Self-Regulated Learning Process--A Rejoinder to Ainley and Patrick

Peer reviewed

Direct link

Efklides, Anastasia – Educational Psychology Review, 2006

The measurement of online self-regulation processes is a very important issue and in this rejoinder to Ainley and Patrick (this issue) I am arguing that including measures of metacognitive experiences, in conjunction with measures of other affective experiences, in various phases of task processing can increase the reliability and validity of…

Descriptors: Metacognition, Learning Processes, Reader Response, Self Management

A Note on the Interpretation of Weighted Kappa and its Relations to Other Rater Agreement Statistics for Metric Scales

Peer reviewed

Direct link

Schuster, Christof – Educational and Psychological Measurement, 2004

This article presents a formula for weighted kappa in terms of rater means, rater variances, and the rater covariance that is particularly helpful in emphasizing that weighted kappa is an absolute agreement measure in the sense that it is sensitive to differences in rater's marginal distributions. Specifically, rater mean differences will decrease…

Descriptors: Computation, Rating Scales, Interrater Reliability, Statistical Analysis

A Reliability Induction and Reliability Generalization Study of the Cage Questionnaire

Peer reviewed

Direct link

Shields, Alan L.; Caruso, John C. – Educational and Psychological Measurement, 2004

The CAGE is a commonly used alcohol screening instrument. Although considerable work has been done on the validity of CAGE scores, relatively little information is available on their reliability. Reliability induction and generalization studies were performed for the CAGE. Of the 259 studies available for analysis, only 19 (7.3%) contained…

Descriptors: Logical Thinking, Generalization, Test Reliability, Questionnaires

Performance of SIBTEST When the Percentage of DIF Items Is Large

Peer reviewed

Direct link

Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004

Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…

Descriptors: True Scores, Simulation, Test Bias, Student Evaluation

Objectivity, Reliability, and Validity of the Bent-Knee Push-Up for College-Age Women

Peer reviewed

Direct link

Wood, Heather M.; Baumgartner, Ted A. – Measurement in Physical Education and Exercise Science, 2004

The revised push-up test has been found to have good validity but it produces many zero scores for women. Maybe there should be an alternative to the revised push-up test for college-age women. The purpose of this study was to determine the objectivity, reliability, and validity for the bent-knee push-up test (executed on hands and knees) for…

Descriptors: Body Weight, Athletics, Females, Predictive Validity

Have Disfluency-Type Measures Contributed to the Understanding and Treatment of Developmental Stuttering?

Peer reviewed

Direct link

Einarsdottir, Johanna; Ingham, Roger J. – American Journal of Speech-Language Pathology, 2005

Purpose: This article critically reviews evidence to determine whether the use of disfluency typologies, such as "syllable repetitions" or "prolongations", has assisted the understanding or treatment of developmental stuttering. Consideration is given to whether there is a need for a fundamental shift in the basis for constructing measures of…

Descriptors: Stuttering, Measures (Individuals), Evidence, Test Reliability

Reliability as a Function of the Number of Item Options Derived from the "Knowledge or Random Guessing" Model

Peer reviewed

Direct link

MacCann, Robert G. – Psychometrika, 2004

For (0, 1) scored multiple-choice tests, a formula giving test reliability as a function of the number of item options is derived, assuming the "knowledge or random guessing model," the parallelism of the new and old tests (apart from the guessing probability), and the assumptions of classical test theory. It is shown that the formula is a more…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Test Theory

Reactive Attachment Disorder in Maltreated Toddlers

Peer reviewed

Direct link

Zeanah, Charles H.; Scheeringa, Michael; Boris, Neil W.; Heller, Sherryl S.; Smyke, Anna T.; Trapani, Jennifer – Child Abuse & Neglect: The International Journal, 2004

Objective: To determine if Reactive Attachment Disorder (RAD) can be reliably identified in maltreated toddlers in foster care, if the two types of RAD are independent, and to estimate the prevalence of RAD in these maltreated toddlers. Methods: Clinicians treating 94 maltreated toddlers in foster care were interviewed regarding signs of…

Descriptors: Attachment Behavior, Behavior Disorders, Toddlers, Child Abuse

Validation of a Parent Outcome Questionnaire from Pediatric Cochlear Implantation

Peer reviewed

Direct link

Terezinha, Nunes; Ursula, Pretzlik; Selin Ilicak – Journal of Deaf Studies and Deaf Education, 2005

This paper analyzes the reliability and validity of a questionnaire designed by Archbold, Lutman, Gregory, O'Neil, and Nikolpoulos (2002) for the assessment of pediatric cochlear implantation. Parents of 61 youngsters (age range 5 to 16 years), who had the implant for at least 3 years, responded to the questionnaire and to an interview. The alpha…

Descriptors: Questionnaires, Pediatrics, Assistive Technology, Reliability

Reliability in Content Analysis: Some Common Misconceptions and Recommendations

Peer reviewed

Direct link

Krippendorff, Klaus – Human Communication Research, 2004

In a recent article in this journal, Lombard, Snyder-Duch, and Bracken (2002) surveyed 200 content analyses for their reporting of reliability tests, compared the virtues and drawbacks of five popular reliability measures, and proposed guidelines and standards for their use. Their discussion revealed that numerous misconceptions circulate in the…

Descriptors: Misconceptions, Content Analysis, News Reporting, Measurement Techniques

Diagnostic Assessment of Asperger's Disorder: A Review of Five Third-Party Rating Scales

Peer reviewed

Direct link

Campbell, Jonathan M. – Journal of Autism and Developmental Disorders, 2005

Five rating scales for screening and detection of Asperger's Disorder, three commercially available and two research instruments, are evaluated with reference to psychometric criteria outlined by Bracken in 1987 ("Journal of Psychoeducational Assessment," 4, 313). Reliability and validity data reported in examiner's manuals or published reports…

Descriptors: Diagnostic Tests, Asperger Syndrome, Rating Scales, Clinical Diagnosis

The Psychometric Properties of the Vineland Adaptive Behavior Scales in Children and Adolescents with Mental Retardation

Peer reviewed

Direct link

de Bildt, Annelies; Kraijer, Dirk; Sytema, Sjoerd; Minderaa, Ruud – Journal of Autism and Developmental Disorders, 2005

The psychometric properties of the Vineland Adaptive Behavior Scales Survey Form were studied in a total population of children and adolescents with MR, and in the specific levels of functioning (n=826, age 4-18 years). The original division into (sub)domains, as assigned by the authors, was replicated in the total population and in the mild and…

Descriptors: Psychometrics, Measures (Individuals), Children, Adolescents

A Reliability Generalization Study of the Self-Description Questionnaire

Peer reviewed

Direct link

Leach, Lesley F.; Henson, Robin K.; Odom, Leslie R.; Cagle, Lynne S. – Educational and Psychological Measurement, 2006

The use of reliability generalization methodology promises to, among other things, inform researchers about the importance of reporting reliability coefficients and their use in result interpretation. This study presents results from a reliability generalization study of the Self-Description Questionnaire (SDQ). The average score reliabilities…

Descriptors: Reliability, Questionnaires, Research Methodology, Scores

« Previous Page | Next Page »

Pages: 1 | ... | 1144 | 1145 | 1146 | 1147 | 1148 | 1149 | 1150 | 1151 | 1152 | ... | 1809

Educational and Psychological…	810
ProQuest LLC	659
Journal of Psychoeducational…	399
Online Submission	327
Journal of Educational…	252
Journal of Autism and…	235
Psychology in the Schools	233
Measurement and Evaluation in…	230
Grantee Submission	184
Psychological Assessment	180
Journal of Speech, Language,…	174
Measurement in Physical…	170
Applied Psychological…	149
Assessment for Effective…	138
International Journal of…	134
Journal of Consulting and…	131
Educational Research and…	130
Assessment & Evaluation in…	124
Language Testing	120
Psychometrika	120
Research on Social Work…	120
Educational Sciences: Theory…	119
Applied Measurement in…	111
International Journal of…	110
ETS Research Report Series	106
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	44
Race to the Top	27
Elementary and Secondary…	20
Every Student Succeeds Act…	20
Elementary and Secondary…	16
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Head Start	5
Education Consolidation…	4
Education for All Handicapped…	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	176
Peabody Picture Vocabulary…	88
SAT (College Admission Test)	86
Test of English as a Foreign…	82
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	67
Program for International…	62
Child Behavior Checklist	61
National Assessment of…	56
ACT Assessment	52
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
Beck Depression Inventory	50
Autism Diagnostic Observation…	47
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	45
Motivated Strategies for…	43
Raven Progressive Matrices	43
Behavior Assessment System…	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Vineland Adaptive Behavior…	39
Kaufman Assessment Battery…	38
More ▼