ERIC - Search Results

Publication Date

In 2026	3
Since 2025	675
Since 2022 (last 5 years)	3176
Since 2017 (last 10 years)	7417
Since 2007 (last 20 years)	15055

Descriptor

Test Reliability	15043
Test Validity	10279
Reliability	9761
Foreign Countries	7144
Test Construction	4825
Validity	4191
Measures (Individuals)	3877
Factor Analysis	3825
Psychometrics	3526
Interrater Reliability	3124
Correlation	3040
Evaluation Methods	2746
Statistical Analysis	2533
Higher Education	2515
Questionnaires	2473
Scores	2386
College Students	2211
Student Attitudes	2148
Comparative Analysis	1943
Factor Structure	1822
Student Evaluation	1695
Rating Scales	1623
Measurement Techniques	1562
Test Items	1528
Construct Validity	1498
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	19242
Reports - Research	17430
Reports - Evaluative	3328
Speeches/Meeting Papers	1861
Tests/Questionnaires	1598
Reports - Descriptive	1544
Information Analyses	958
Dissertations/Theses -…	673
Opinion Papers	645
Guides - Non-Classroom	325
Numerical/Quantitative Data	252
Books	135
Guides - Classroom - Teacher	81
Reports - General	70
Guides - General	57
Reference Materials -…	53
Collected Works - General	40
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Non-Print Media	22
Dissertations/Theses	21
ERIC Digests in Full Text	20
More ▼

Education Level

Higher Education	4726
Postsecondary Education	3740
Secondary Education	2273
Elementary Education	2197
High Schools	1085
Middle Schools	1033
Elementary Secondary Education	876
Early Childhood Education	874
Junior High Schools	715
Primary Education	427
Intermediate Grades	401
Preschool Education	385
Grade 5	342
Grade 8	325
Grade 4	318
Grade 6	299
Grade 7	279
Grade 3	270
Kindergarten	267
Adult Education	211
Grade 1	202
Grade 2	173
Grade 9	154
Grade 10	140
Grade 11	109
More ▼

Audience

Researchers	709
Practitioners	451
Teachers	208
Administrators	122
Policymakers	66
Counselors	42
Students	38
Parents	11
Community	7
Support Staff	6
Media Staff	5
More ▼

Location

Turkey	1328
Australia	436
Canada	379
China	368
United States	271
United Kingdom	256
Indonesia	253
Taiwan	234
Netherlands	223
Spain	217
California	215
Germany	197
United Kingdom (England)	192
Malaysia	170
Hong Kong	161
Florida	159
Iran	156
Nigeria	149
South Korea	135
Texas	134
India	127
New York	119
Pennsylvania	114
South Africa	109
Japan	106
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 15,601 to 15,615 of 27,107 results Save | Export

Selecting and Evaluating Standardized Reading Tests (Test Review).

Peer reviewed

Lewandowski, Lawrence J.; Martens, Brian K. – Journal of Reading, 1990

Provides an approach for selecting and evaluating both group and individually administered standardized tests of reading. Reviews considerations of the quality of test development; test content; test reliability and validity; and concerns of cost and time investment. Presents sample ratings of two common instruments. (RS)

Descriptors: Reading Tests, Secondary Education, Standardized Tests, Test Reliability

A Consideration of the Validity and Reliability of Suicide Mortality Data.

Peer reviewed

O'Carroll, Patrick W. – Suicide and Life-Threatening Behavior, 1989

Briefly outlines problems associated with definition and official certification of suicide and reviews literature pertaining to validity and reliability of suicide statistics. Considers process of suicide certification as a test, estimating its sensitivity, specificity, and predictive value, using data from studies reviewed. (NB)

Descriptors: Attrition (Research Studies), Death, Evaluation Problems, Reliability

Interjudge Agreement and the Maximum Value of Kappa.

Peer reviewed

Umesh, U. N.; And Others – Educational and Psychological Measurement, 1989

An approach is provided for calculating maximum values of the Kappa statistic of J. Cohen (1960) as a function of observed agreement proportions between evaluators. Separate calculations are required for different matrix sizes and observed agreement levels. (SLD)

Descriptors: Equations (Mathematics), Evaluators, Heuristics, Interrater Reliability

Stability Reliability of the Behavior Rating Profile.

Peer reviewed

Ellers, Robert A.; And Others – Journal of School Psychology, 1989

Examined test-retest stability of Behavior Rating Profile for students grades l-12 (N=198), parents (N=212), and teachers (N=176) on 3 norm-referenced scales. Found Teacher Rating scale reliable across all grades for screening and eligibility, Parent Rating scale reliable for Grade 3-12 screening and Grade 3-6,ll, and l2, eligibility. Found…

Descriptors: Behavior Rating Scales, Elementary Secondary Education, Special Education, Test Reliability

Some Comments on the Relation between Reliability and Statistical Power.

Peer reviewed

Humphreys, Lloyd G.; Drasgow, Fritz – Applied Psychological Measurement, 1989

Issues arising from difference scores with zero reliability that nevertheless allow a powerful test of change are discussed. Issues include the appropriateness of underlying statistical models for psychological data and the relationship between difference scores and power. Increases in reliability always increase power for a fixed effect size.…

Descriptors: Goodness of Fit, Mathematical Models, Power (Statistics), Psychometrics

Consistency of Rasch Model Parameter Estimation: A Simulation Study.

Peer reviewed

van den Wollenberg, Arnold L.; And Others – Applied Psychological Measurement, 1988

The unconditional--simultaneous--maximum likelihood (UML) estimation procedure for the one-parameter logistic model produces biased estimators. The UML method is inconsistent and is not a good alternative to conditional maximum likelihood method, at least with small numbers of items. The minimum Chi-square estimation procedure produces unbiased…

Descriptors: Computer Simulation, Estimation (Mathematics), Maximum Likelihood Statistics, Reliability

Introduction to the Structure and Application of the Stanford-Binet Intelligence Scale-Fourth Edition.

Peer reviewed

Glutting, Joseph J. – Journal of School Psychology, 1989

Introduces Stanford-Binet Intelligence Scale-Fourth Edition (SB4) as an attempt to revitalize Stanford-Binet by maintaining links with previous editions while simultaneously incorporating more recent developments found in other popular tests of intelligence. Discusses the SB4's theoretical foundation, materials and administration, scaling,…

Descriptors: Intelligence Tests, Models, Test Reliability, Test Use

Impact of Measurement Error on Statistical Power: Review of an Old Paradox.

Peer reviewed

Williams, Richard H.; And Others – Journal of Experimental Education, 1995

The paradox that a Student t-test based on pretest-posttest differences can attain its greatest power when the difference score reliability is zero was explained by demonstrating that power is not a mathematical function of reliability unless either true score variance or error score variance is constant. (SLD)

Descriptors: Error of Measurement, Power (Statistics), Pretests Posttests, Reliability

The Reliability of Observational Data: II. Issues in the Identification and Measurement of Stuttering Events.

Peer reviewed

Cordes, Anne K.; Ingham, Roger J. – Journal of Speech and Hearing Research, 1994

This paper reviews the prominent concepts of the stuttering event and concerns about the reliability of stuttering event measurements, specifically interjudge agreement. Recent attempts to resolve the stuttering measurement problem are reviewed, and the implications of developing an improved measurement system are discussed. (Author/JDD)

Descriptors: Data Collection, Interrater Reliability, Measurement Techniques, Observation

AB with Multiple Wells: 1. Why Are Multiple Wells Sometimes Easier than Two Wells? 2. Memory or Memory + Inhibition?

Peer reviewed

Diamond, Adele; And Others – Developmental Psychology, 1994

Found that faulty test procedures may explain why infants sometimes locate hidden objects more easily in multiple-well tests than in two-well trials. Also found that errors in seven-well tests were not evenly distributed but occurred disproportionately in the direction of the previously correct well, suggesting that memory and inhibition are both…

Descriptors: Infants, Inhibition, Memory, Recall (Psychology)

The Consistency of Peer Review in Student Writing Projects.

Peer reviewed

Marcoulides, George A.; Simkin, Mark G. – Journal of Education for Business, 1995

Each paper written by 60 sophomores in computer classes received 3 peer evaluations using a structured evaluation process. Overall, students were able to grade efficiently and consistently in terms of overall score and selected criteria (subject matter, content, and mechanics). (SK)

Descriptors: Higher Education, Interrater Reliability, Peer Evaluation, Undergraduate Students

Psychological Adjustment of Children with Sickle Cell Disease: Stability and Change over a 10-Month Period.

Peer reviewed

Thompson, Robert J.; And Others – Journal of Consulting and Clinical Psychology, 1994

Describes investigation utilizing sickle cell disease subjects from a stress and coping project. Found little stability in classification of individuals' adjustment, low congruence in behavior problem patterns and diagnoses, and less stability in adjustment by child report than mother report. Suggests children's coping strategies are intervention…

Descriptors: Children, Classification, Coping, Preadolescents

Inter-rater and Intra-rater Reliability of the Occupational Therapy Diagnosis.

Peer reviewed

Driessen, Marie-Jose; And Others – Occupational Therapy Journal of Research, 1995

Two occupational therapists in an interrater test and 9 in an intrarater test used a form based on the International Classification of Impairments, Disabilities, and Handicaps to evaluate 50 patients in a psychiatric hospital and 50 in a rehabilitation center. Based on percentage of agreement and Cohen's kappa, the reliability of the diagnoses was…

Descriptors: Clinical Diagnosis, Disabilities, Interrater Reliability, Occupational Therapy

Use of "Vague" Quantifiers in Measuring Communication Behaviors.

Peer reviewed

Kennamer, J. David – Journalism Quarterly, 1992

Investigates the use of "vague quantifiers" (terms such as "often,""sometimes,""rarely," or "never") in communication research. Finds that these words do not always mean the same thing to different people, and thus may not constitute interval scales. Suggests that research outcomes based upon such…

Descriptors: Communication Research, Higher Education, Research Methodology, Research Problems

Gender Differences in the Structure of Interests.

Peer reviewed

Hansen, Jo-Ida C.; And Others – Journal of Vocational Behavior, 1993

Multidimensional scaling was applied to Women-in-General (n=300) and Men-in-General (n=300) samples of the Strong Interest Inventory. Participants were matched on occupational title, obtaining two-dimensional solutions that demonstrated gender differences in the underlying structure of vocational interests. (SK)

Descriptors: Interest Inventories, Multidimensional Scaling, Sex Differences, Test Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 1037 | 1038 | 1039 | 1040 | 1041 | 1042 | 1043 | 1044 | 1045 | ... | 1808

Educational and Psychological…	810
ProQuest LLC	659
Journal of Psychoeducational…	399
Online Submission	327
Journal of Educational…	252
Journal of Autism and…	233
Psychology in the Schools	232
Measurement and Evaluation in…	230
Grantee Submission	184
Psychological Assessment	180
Journal of Speech, Language,…	174
Measurement in Physical…	170
Applied Psychological…	149
Assessment for Effective…	138
International Journal of…	134
Journal of Consulting and…	131
Educational Research and…	130
Assessment & Evaluation in…	124
Language Testing	120
Psychometrika	120
Research on Social Work…	120
Educational Sciences: Theory…	119
Applied Measurement in…	111
International Journal of…	110
ETS Research Report Series	106
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	44
Race to the Top	27
Elementary and Secondary…	20
Every Student Succeeds Act…	20
Elementary and Secondary…	16
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Head Start	5
Education Consolidation…	4
Education for All Handicapped…	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	176
Peabody Picture Vocabulary…	88
SAT (College Admission Test)	86
Test of English as a Foreign…	82
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	66
Program for International…	62
Child Behavior Checklist	59
National Assessment of…	56
ACT Assessment	52
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
Beck Depression Inventory	50
Autism Diagnostic Observation…	47
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	45
Motivated Strategies for…	43
Raven Progressive Matrices	43
Behavior Assessment System…	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Vineland Adaptive Behavior…	39
Kaufman Assessment Battery…	38
More ▼