Literature Summary: Faking in Personnel Selection

Here is a summary of some papers that are related to the topic of faking in personnel selection.

McCrae and Costa (1983)

  • Social desirability (SD) is better interpreted as substantial traits than as indicators of response bias
  • Using SD to correct for response bias should be questioned

Anderson, Warner, and Spencer (1984)

  • Inflation bias is prevalent and pervasive in employment selection
  • Inflation bias is negatively correlated with an external performance measure

Hough, Eaton, Dunnette, and Kamp (1990)

  • validities were in the .20s (uncorrected for unreliability or restriction in range) against targeted criterion constructs
  • Respondents successfully distorted their self-descriptions when instructed to do so
  • Response validity scales were responsive to different types of distortion
  • applicants’ responses did not reflect evidence of distortion
  • validities remained stable regardless of possible distortion by respondents in either unusually positive or negative directions

Holden and Kroner (1992)

  • Test item response times were statistically adjusted to reflect item latencies in relation both to the person and to the item
  • Discriminant function analysis indicated that such times could significantly differentiate among standard responding, faking good responses, and faking bad responses
  • classification hit rates with differential response latencies compared favorably with those rates found with more traditional response dissimulation scales

Schmidt and Ryan (1993)

  • Similar factor structures should not be assumed across testing situations
  • In the current study, a five-factor structure fit the student sample but not the applicant sample
  • There is probably an ideal-employee factor in the applicant sample

Barrick and Mount (1996)

  • In two long-haul trucker samples, C ($\rho = -.26$ and $-.26$) and ES ($\rho = -.23$ and $-.21$) were valid predictors of voluntary turnover
  • C ($\rho = .41$ and $.39$) and ES ($\rho = .23$ and $.27$) were valid predictors of supervisor-rated job performance
  • Applicants did distort their scores on both C and ES scales
  • Distortion occurred both through self-deception and impression management
  • However, neither type of distortion attenuated the predictive validities of either personality construct

Ones, Viswesvaran, and Reiss (1996)

  • Meta-analysis
  • Social desirability scales were found not to predict school success, task performance, counterproductive behaviors, and job performance
  • social desirability is not as pervasive a problem as has been anticipated by industrial-organizational psychologists
  • social desirability is in fact related to real individual differences in emotional stability and conscientiousness
  • social desirability does not function as a predictor, as a practically useful suppressor, or as a mediator variable for the criterion of job performance
  • Removing the effects of social desirability from the Big Five dimensions of personality leaves the criterion-related validity of personality constructs for predicting job performance intact

Zickar and Drasgow (1996)

  • Appropriateness measurement: quantifies the difference between an examinee’s observed pattern of item responses to responses expected on the basis of that person’s standing on the latent trait 0 and a set of item response functions (IRFs), as specified by some IRT model. IRFs are functions that relate 0 to the probability of affirming an item. An examinee whose pattern of responses greatly differs from the expected pattern of responses will have an extreme appropriateness index
  • The item response theory approach (appropriateness measurement) classified a higher number of faking respondents at low rates of misclassification of honest respondents (false positives) than did a social desirability scale
  • At higher false positive rates, the social desirability approach did slightly better

Hough (1998)

  • Strategy 1: “correcting” an individual’s content scale scores based on the individual’s score on an Unlikely Virtues (UV) scale
  • Strategy 2: removing people from the applicant pool because their scores on an UV scale suggest they are presenting themselves in an overly favorable way
  • Incumbent and applicant data from three large studies were used to evaluate the two strategies. The data suggest that
    • (a) neither strategy affects criterion-related validities
    • (b) both strategies produce applicant mean scores for content scales that are closer to incumbent mean scores
    • (c) men, women, Whites, and minorities are not differentially affected
    • (d) both strategies result in a subset of people who are not hired who would otherwise have been hired
  • If one’s goal is to reduce the impact of intentional distortion on hiring decisions, both strategies appear reasonably effective

Snell, Sydell, and Lueke (1999)

  • Proposed an interactional model of applicant faking based on individual differences
  • Successful faking involves:
    • Ability to fake
      • Dispositional factors: GMA (e.g., Jensen, 1998), EI (e.g., Mayer & Salovey, 1997)
      • Experiential factors
      • Test characteristics: item type, item format, item scoring
    • Motivation to fake
      • Demographic factors: age, gender (these are probably moderators, not predictors)
      • Dispositional factors: impression management, integrity, Machiavellianism, manipulativeness, organizational delinquency, locus of control, stage of cognitive moral development
      • Perceptual factors: others’ behavior, others’ attitudes, fairness

Viswesvaran and Ones (1999)

  • The authors examined whether individuals can fake their responses to a personality inventory if instructed to do so
  • Between-subjects and within-subject designs were meta-analyzed separately
  • Across 51 studies, fakability did not vary by personality dimension, all the Big Five factors were equally fakable
    • When instructed to fake good, participant were able to change their responses by almost half a standard deviation on average
  • Faking produced the largest distortions in social desirability scales
  • Instructions to fake good produced lower effect sizes compared with instructions to fake bad
  • Within-subjects designs produce more accurate estimates
  • Between-subjects designs may distort estimates due to Subject x Treatment interactions and low statistical power
  • An avenue for fruitful future research lies in investigating whether individual differences in fakability contribute valid variance to the criterion of interest (e.g., job performance). For example,to the extent that fakability reflects social intelligence or some form of adaptability, individual differences in fakability may contribute to explaining successful job performance, especially in some occupations such as salespersons, politicians, customer service representatives, and so forth

Zickar and Robie (1999)

  • Military recruits were instructed to complete a personality inventory under 1 of 3 conditions: answer honestly, fake good, or fake good with coaching
  • A graded response model (F. Samejima, 1969) was fit to items from 3 personality scales
  • Although there was a large difference in latent personality trait scores because of faking, there were few differences in the functioning of items across conditions
  • Results of confirmatory factor analyses suggest that faking leads to an increase in common variance that was unrelated to substantive construct variance

Jackson, Wroblewski, and Ashton (2000)

  • Evaluated the effects of faking on mean scores and correlations with self-reported counterproductive behavior of integrity-related personality items administered in single-stimulus and forced-choice formats
  • In laboratory studies, respondents instructed to respond as if applying for a job scored higher than when given standard or “straight-take” instructions - - The size of the mean shift was nearly a full standard deviation for the single-stimulus integrity measure, but less than one third of a standard deviation for the same items presented in a forced-choice format
  • The correlation between the personality questionnaire administered in the single-stimulus condition and self-reported workplace delinquency was much lower in the job applicant condition than in the straight-take condition, whereas the same items administered in the forced-choice condition maintained their substantial correlations with workplace delinquency

Piedmont, McCrae, Riemann, and Angleitner (2000)

  • The authors evaluated the utility of several types of validity scales in a volunteer sample of 72 men and 106 women who completed the Revised NEO Personality Inventory (NEO-PI-R; P. T. Costa & R. R. McCrae, 1992) and the Multidimensional Personality Questionnaire (MPQ; A. Tellegen, 1978/1982) and were rated by 2 acquaintances on the observer form of the NEO-PI-R
  • Analyses indicated that the validity indexes lacked utility in this sample
  • A partial replication (N = 1,728) also failed to find consistent support for the use of validity scales
  • The authors illustrate the use of informant ratings in assessing protocol validity and argue that psychological assessors should limit their use of validity scales and seek instead to improve the quality of personality assessments

Ferrando and Chico (2001)

  • The present study examined whether an internal procedure for assessing the scalability of the response patterns, based on item response theory (IRT), can detect deliberate dissimulation (faking good) in the Extraversion, Neuroticism, and Psychoticism scale scores of the Eysenck Personality Questionnaire Revised
  • The procedure is compared to the traditional approaches, which use the Lie and the Social Desirability (SD) scales
  • A data set was analyzed in which participants were either administered the measures in standard conditions or given special instructions to fake good
  • The results showed that the IRT-based measures were not powerful enough to detect dissimulation, whereas the Lie and SD scales performed much better

Holden, Wood, and Tomashewski (2001)

  • Response time restriction as a method for reducing the influence of faking on personality scale validity
  • No evidence emerged to indicate that limiting respondents’ answering time can attenuate the effects of faking on validity
  • Results of the three current experiments indicate that limiting response time does not prevent or reduce the effect of faking on the validity of self-report personality scales
  • Results were interpreted as failing to support a simple model of personality test item response dissimulation that predicts that lying takes time
  • Findings were consistent with models implying that lying involves primitive cognitive processing or that lying may be associated with complex processing that includes both primitive responding and cognitive overrides

Stark, Chernyshenko, Chan, Lee, and Drasgow (2001)

  • Faking is operationalized as either a dispositional (trailed) or a situational variable
  • The 16PF scale reliabilities were slightly lower in the applicant sample
  • The situational faking investigation of the noncognitive scales, excluding IM, indicated that numerous items displayed DIF across testing situations
  • DIF did not cancel when summing across items, so DTP resulted
  • Situational faking seems to be substantial, and the scales designed to detect it may not function as intended among job applicants

Donovan, Dwight, and Hurtz (2003)

  • This study used the randomized-response technique to estimate the base rate of entry-level job applicants faking during the application process
  • The results revealed that a substantial number of recent job applicants did report engaging in varying degrees of misrepresentation, and that the base rate for faking is strongly related to both the severity and verifiability of the deceptive behavior

Dwight and Donovan (2003)

  • Warning which identified that faking could be identified and the potential consequences of faking impacted responding

Paulhus, Harms, Bruce, and Lysy (2003)

  • Developed over-claiming questionnaire (OCQ) and analyze over-claiming using signal detection theory
  • The over-claiming technique provides an operationalization of self-enhancement that is both concrete and independent of cognitive ability
  • Study 1: OCQ knowledge ratings appear to be valid operationalizations of cognitive ability and self-enhancement, respectively
  • Study 2: Participants showed a more modest self presentation when the possible embarrassment of claiming a nonexistent item was made salient; the validity of the over-claiming index as a measure of self-enhancement was not compromised by the warning
  • Study 3: OCQ bias, the over-claiming index, appears to be sensitive to both trait and situational sources of self-enhancement. When trying to give a positive impression, participants showed a substantially higher rate of over-claiming; At the same time, individual differences continue to play a role in predicting over-claiming. Within each condition, narcissists over-claimed more than did non-narcissists. Hence, the overclaiming index remains a valid indicator of trait self-enhancement regardless of the potentially disruptive impact of demand for self-presentation
  • Study 4:

Zickar, Gibby, and Robie (2004)

  • Mixed-model item response theory was used to identify subgroups within samples of individuals taking two different personality inventories under various conditions
  • Across the applicant and incumbent data sets, the authors generally found that three classes were needed to model all response patterns
  • Results demonstrate that previous assumptions about the nature of faking on personality inventories have been too restrictive

Birkeland, Manson, and Kisamore (2006)

  • This study meta-analytically investigated the extent to which job applicants fake their responses on personality tests
  • Across all job types, applicants scored significantly higher than non-applicants on extraversion (d = .11), emotional stability (d 5 .44), conscientiousness (d 5 .45), and openness (d = .13)
  • For certain jobs (e.g., sales), however, the rank ordering of mean differences changed substantially suggesting that job applicants distort responses on personality dimensions that are viewed as particularly job relevant
  • Smaller mean differences were found in this study than those reported by Viswesvaran and Ones (Educational and Psychological Measurement, 59(2), 197–210), who compared scores for induced “fake-good” vs. honest response conditions
  • direct Big Five measures produced substantially larger differences than did indirect Big Five measures

McFarland and Ryan (2006)

  • Tested a model that integrated the theory of planned behavior (TPB) with a model of faking presented by McFarland and Ryan (2000) to predict faking on a personality test
  • In Study 1, the TPB explained sizable variance in the intention to fake.
  • In Study 2, the TPB explained both the intention to fake and actual faking behavior
  • Study 2 did not find evidence for the two propose moderators: valence toward performing well on the test and warning that a social desirability scale

Mueller-Hanson, Heggestad, and Thornton (2006)

  • Proposed and tested a model of psychological processes underlying faking
  • Personality factors and perceptions of situational factors contribute to faking behavior
    • perceptions of the situation (belief in the importance of faking, one’s perceived behavioral control, and one’s beliefs about subjective norms) and Conscientiousness and Emotional Stability were related to intentions to fake which, in turn, were related to faking behavior
    • ability to fake was not related to intentions to fake, and willingness to fake had an unexpected negative relationship to intentions to fake
  • The implications of these findings are
    • (a) people differ with regard to how much they will fake on a personality test in a simulated employment setting with some people faking substantially and others faking very little or not at all
    • (b) the extent to which an individual fakes is partially determined by the person’s attitudes and personality characteristics.

