Research Article

Consistent Kindness: Money Allocation and Kind Act Decisions Are Regulated by a ‘Welfare Trade-Off Ratio’

Oliver Scott Curry^1,² , Chloe San Miguel² , James Wilkinson³ , Mehmet Necip Tunç⁴

[1] School of Anthropology and Museum Ethnography, University of Oxford, Oxford, United Kingdom. [2] kindness.org, New York, NY, USA. [3] Department of Work and Social Psychology, Maastricht University, Maastricht, The Netherlands. [4] Department of Social Psychology, Tilburg University, Tilburg, The Netherlands.

Social Psychological Bulletin, 2026, Vol. 21, Article e14583, https://doi.org/10.32872/spb.14583

Received: 2024-05-07. Accepted: 2025-04-13. Published (VoR): 2026-04-09.

Handling Editor: Gabriela Czarnek, Jagiellonian University, Krakow, Poland

Corresponding Author: Oliver Scott Curry, School of Anthropology and Museum Ethnography, 51/53 Banbury Road, Oxford, OX2 6PE, United Kingdom. E-mail: oliver.curry@anthro.ox.ac.uk

Supplementary Materials: Code, Data, Materials, Preregistration [see Index of Supplementary Materials]

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License, CC BY 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Is kindness regulated by a cost-benefit ratio? Previous research suggests that money allocation decisions are regulated by a ‘welfare trade-off ratio’ (WTR) that reflects the weight attached to the actor’s welfare relative to the recipient’s welfare. Here we replicate this research, and extend it by creating a new measure—The Kindness Questionnaire—which asks which real-world acts of kindness, previously rated for cost and benefit, participants would perform for others. In Study 1 (n = 6,601) money allocation (MA) and Kindness Questionnaire (KQ) decisions for family, friends, colleagues and strangers were highly consistent with an underlying WTR (~92%); more consistent than would be expected by chance; and generally more consistent than with cost or benefit alone. WTRs were high (~0.81); and, for money allocation, declined with social distance. In Study 2 (n = 8,492) MA and KQ decisions for neighbors were highly consistent with an underlying WTR (~89%); more consistent than would be expected by chance; and generally more consistent than with cost or benefit alone. WTRs were high (~0.75). In both studies, The Kindness Questionnaires showed good convergent, divergent and incremental validity. These studies corroborate ‘welfare tradeoff ratio’ theory, establish proof of principle for a new way of measuring kindness, and provide new tools for measuring kindness to colleagues, strangers and neighbors.

Keywords: kindness, welfare trade-off ratio, prosocial behavior, scale development

Highlights

Previous research suggests that monetary allocation decisions are guided by a psychological cost-benefit variable called a welfare trade-off ratio (WTR).
This study introduces and tests a new tool, the Kindness Questionnaire (KQ), to measure WTRs for real-world kind acts.
In two large U.S. samples, decisions about performing kind acts were highly consistent with an underlying WTR.
These findings suggest that the same psychological logic applies to both monetary and real-world kind decisions, offering a new way to understand and assess kindness.

Kindness is typically understood as actions intended to benefit others, at some cost to the actor—an ‘ABC’ model of kindness (Curry et al., 2018). Whereas it was once a problem to explain why anyone would pay a cost to benefit others, various theories now exist (kin altruism, mutualism, reciprocal altruism, competitive altruism) that explain why we are kind to family, friends, colleagues and strangers (Curry et al., 2018).

Previous research has suggested that people’s decisions about whether to be kind to others do not depend solely on the cost incurred (‘help if cost is below a certain level’), or the benefit provided (‘help if benefit is above a certain level’), but rather on the ratio of the cost to benefit (‘help if the ratio of cost to benefit is below a certain level’). This ratio represents the point at which individuals are indifferent between cost to self and benefit to others, and it is used to make decisions about kind acts. For example, a person who is willing to help another up to a cost-benefit ratio of 0.50 would pay a cost of $5 (or less) to provide a benefit of $10 (or more), and would also pay a cost of $15 (or less) to provide a benefit of $30 (or more). Individuals differ in the cost-benefit ratios they employ: a ‘kinder’ person is willing to incur a greater cost to provide a given level of benefit. For example, a person who is willing to help another up to a cost-benefit of 0.75 would pay a cost of $7.50 (or less) to provide a benefit of $10 (or more), and so on. The ratio can be interpreted as the weight an actor attaches to their own welfare relative to the recipient’s welfare, and for that reason has been referred to as a ‘welfare trade-off ratio’ (WTR) (Delton et al., 2023; Delton & Robertson, 2016).

Empirical evidence in support of such WTRs comes mostly from money-allocation tasks, in which it has been found that decisions are highly consistent with an underlying WTR, and that the value of this ratio declines with social distance. For example, in one study with US students (n = 167), decisions were 97% consistent with an underlying WTR; and the ratios were 0.62 for friends, and 0.34 for acquaintances (Delton et al., 2023; Delton & Robertson, 2016). Another study with MTurkers (n = 3,864) found median WTRs of 1.07 for mothers and fathers, 0.93 for romantic partners, 0.80 for siblings and friends, 0.53 for acquaintances, and 0.27 for strangers (Forster et al., 2017). A further study with US students found WTRs of 0.26 to strangers in ‘low need’, and 0.60 to strangers in 'high need’ (Sznycer et al., 2019).

Here we replicate this money allocation research with large samples of the US general public. And we extend it by creating and testing a new measure, The Kindness Questionnaire (KQ). The KQ asks people whether they would be willing to perform a series of kind acts previously rated for cost and benefit; this enables us to investigate whether decisions about performing real-world acts of kindness are similarly regulated by a welfare trade-off ratio, and to estimate the value of that ratio. We also investigate the convergent, divergent and incremental validity of The Kindness Questionnaire.

Study 1

Do people make decisions about allocating money and performing real-world acts of kindness that are consistent with an underlying WTR? What is the value of this WTR, and does it decline with social distance across family, friends, colleagues and strangers? And do The Kindness Questionnaires demonstrate good convergent, divergent and incremental validity?

Method

We designed a survey that asked participants to complete standard money allocation tasks (MAs), and Kindness Questionnaires (KQs), to family, friends, colleagues and strangers, and to complete the Multidimensional Measure of Prosocial Behavior (MMPB) (Nielson et al., 2017). Note that all money allocation and kind act decisions were hypothetical, and so technically should be referred to as ‘hypothetical money allocations’ (HMA) and ‘hypothetical kind acts’ (HKA); but for ease of exposition we refer to them simply as ‘money allocation’ (MA) and ‘kind acts’ (KA) throughout. All materials, data and analysis are available on OSF (see Curry et al., 2022).

The MA measures consisted of 15 items, with cost-benefit ratios running from 0.10 to 1.50 (cost-benefit ratios were calculated by dividing the opportunity cost to the participant by the benefit to the recipient. All MA items, and their ratios, are shown in Table S1a). Participants were asked to name a specific family member, friend, and colleague, and to think of a typical stranger. Then, for each of the items, they were asked whether they could prefer an amount for themselves, or an amount for another person. For example: “Choose the option that you prefer: $11.77 for YOU or $13.07 for [family member’s name].”

We created The Kindness Questionnaires (KQs), by choosing a selection of kind acts suitable for family, friends, colleagues and strangers, from a large pool of items previously rated for perceived cost and benefit (Curry et al., in press). In that paper we investigated the relationship between the benefit, cost, cost-benefit ratio and kindness of a large sample of real-world kind acts, drawn from a variety of popular and professional sources. In Study 1 of that paper, participants (n = 15,997) rated 1,692 acts to family, friends, colleagues and strangers on the variables of cost, benefit, and kindness on a 1 (“Not at all [beneficial / costly / kind]”) to 9 (“Extremely [beneficial / costly / kind]) scale. In Study 2 of that paper, participants (n = 4,801) rated 385 acts to a generic ‘someone’. Cost-benefit ratios for the kind acts were calculated by dividing average cost rating for an item by its average benefit rating. By deriving ratios from Likert data, we assume that: costs and benefits have a true zero; and the intervals for kind act costs and benefits (used to create the KQ ratios) are equal, just as they are for the monetary costs and benefits (used to create the MA task).

Different items were chosen from this item pool for different recipients. We would have preferred equal numbers of items, but we were constrained in our choice by: a) the number of items from the pool, that were b) suitable for each recipient (for example, ‘make breakfast in bed’ might be suitable for a family member, but not a colleague), and c) the range and distribution of items’ cost-benefit ratios. The KQ for Family had 23 items, with cost-benefit ratios ranging from 0.21–0.69. The KQ for Friends had 21 items, with cost-benefit ratios ranging from 0.21-0.80.¹ The KQ for Colleagues had 22 items, with cost-benefit ratios ranging from 0.23–0.94. The KQ for Strangers had 19 items, with cost-benefit ratios ranging from 0.25–1.07. All items, and cost-benefit ratios, are shown in Table S1b-e. Participants were asked to name a specific family member, friend, and colleague, and to think of a typical stranger. They were then asked whether they would perform each of the kind acts for each recipient. For example: “Given the opportunity, would you: ‘Help carry heavy bags for [family member’s name]’ – yes or no?” (the average cost rating for this item is 1.99, the average benefit is 7.77, hence the cost-benefit ratio is 0.26).

MAs and KQs were scored using the Kirby method (Kirby, 2000). This method was first developed to measure time discounting, but has subsequently been widely used to measure prosocial decision-making (Delton et al., 2023; Delton & Robertson, 2016; Forster et al., 2017; Sznycer et al., 2019). This method calculates the consistency of responses with each possible welfare trade-off ratio, and reports the maximum consistency, and the ratio with which responses were maximally consistent. For example, if a participant was willing to perform all acts below a ratio of 0.50, and no acts above that ratio, they would be 100% consistent with a ratio of 0.50. Alternatively, if a participant was willing to perform most acts below a ratio of 0.50, and few acts above that ratio, they might be only 90% consistent with a ratio of 0.50. In cases where responses are equally maximally consistent with two or more ratios—for example, 80% consistent with ratios of 0.50 and 0.70—the method reports the geometric mean of those ratios (for example, 80% consistent with a ratio of 0.59). The R-code used to analyze the data using this method is available on OSF (see Curry et al., 2022).

Participants then completed a general measure of prosociality, the Multidimensional Measure of Prosocial Behaviour (Nielson et al., 2017). A sample item reads: “If I see someone being given a hard time, I stand up for that person” (not like me at all 1 - very much like me 5). Participants completed the KQs, then the MAs, then the MMPB. The order of recipients in the KQs and MAs were randomized; the order of items in all scales was randomized.

Participants also completed: one-item measures of religiosity, and political identification; standard demographics (age, sex, ethnicity, education, income) and location (US state); and a series of attention checks.

The study was approved by the Committee on the Use of Human Subjects in Research at Harvard University (IRB19-0070). The survey was hosted on Qualtrics.com, and participants were recruited by PureProfile.com at a cost of £2.60 per participant. Our funding enabled us to aim for samples of approximately 100 from each of the 50 US states (n ≈ 5,000). Data was collected April 8th, 2021 through May 14th, 2021.

Results

A total of 6,601 people completed the survey (age: 52y mean, 18y sd; 56% female, 43% male, 1% other).²

Descriptive statistics for each recipient-specific MA and KQ are shown in Table 1. MAs tended to be bi- or tri-modal, including peaks at the minimum and maximum values. KQs were found to be highly skewed, especially for family and friends where approximately half of the scores were at ceiling. Descriptive statistics (mean, standard deviation) for each MA and KQ item are displayed in Tables S1a-e.

Table 1

MA and KQ Descriptives (Study 1)

		Consistency		WTRs
		Mean	SD	Mean	SD
MA	Family	93%	10%	1.08	0.44
	Friend	92%	11%	0.94	0.47
	Colleague	91%	11%	0.86	0.49
	Stranger	91%	11%	0.73	0.53
KQ	Family	93%	9%	0.67	0.08
	Friend	95%	8%	0.73	0.13
	Colleague	94%	7%	0.81	0.17
	Stranger	88%	9%	0.66	0.23

Consistencies

MA (91–93%) and KQ (88–95%) choices were highly consistent. A repeated-measures ANOVA showed that the main effect of Method was significant, F(1, 6600) = 40.39, p < .001, indicating a difference in consistency between MA and KQ across recipient types. The main effect of Recipient was also significant, F(3, 19800) = 701.14, p < .001, indicating a difference in consistency between recipient groups across the methods. There was a significant interaction between Method and Recipient, F(3, 19800) = 537.77, p < .001. Post-hoc tests using the Bonferroni correction showed that the consistencies of MA and KQ decisions for family were not significantly different (mean difference = -0.1%, p = 1.0), whereas the MA and KQ decisions for friends, colleagues and strangers were significantly (ps < .001, -3% ≤ mean differences ≤ 3%, -0.24 ≤ ds ≤ 0.23) different from one another, although the effect size was small (Table S3 in file KQ_S1_data.omv; Figure 1).

Click to enlarge

Figure 1

MA and KQ Consistencies Compared

Note. Error bars depict 95% confidence intervals.

One sample t-tests showed that the consistency of all measures was significantly (ps < 0.001) and substantially (ds > 2.09) greater than would be expected by chance (68%) (Table S2 in file KQ_S1_data.omv).³ Because consistency scores might be inflated by extreme responses (at floor, or at ceiling) we re-ran these analyses after filtering out participants with minimum or maximum WTRs on any measure. Consistencies in this sample (n = 530) remained high (MA: 90–91%; KQ: 84–91%), and all were still significantly (ps < 0.001) and substantially (ds > 1.56) greater than would be expected by chance.

We also investigated whether MA and KQ choices were more consistent with a cost-benefit ratio (WTR) than with cost or benefit alone (again, calculated using the Kirby method). A repeated-measures ANOVA showed that the main effect of Method was significant, F(1, 6600) = 2026.40, p < .001, indicating a difference between MA and KQ across recipient type and measure (cost alone, benefit alone, or cost-benefit ratio). The main effect of Recipient was significant, F(3, 19800) = 1064.92, indicating a difference between recipient type across the two methods and three measures. The main effect of Measure was also significant, F(2, 13200) = 7854.34, indicating a difference between the cost, benefit, and cost-benefit ratio measures across recipient types and methods.

Post-hoc tests using the Bonferroni correction showed that MA choices for all recipients were significantly (ps < .001) and substantially more consistent with WTR than with cost (10%≤ mean differences ≤ 15%, 0.69 ≤ ds ≤ 0.87) or benefit (7% ≤ mean differences ≤ 10%, 0.55 ≤ ds ≤ 0.76) alone. KQ choices for family, colleagues and strangers were significantly (ps < .001) more consistent with WTR than with cost (0.1% ≤ mean differences ≤ 3%, 0.11 ≤ ds ≤ 0.64) alone; KQ choices for friends were not significantly more consistent with WTR than with cost (mean difference = 0.004%, p = 1.0). KQ choices for all recipients were significantly (ps < .001) more consistent with WTR than with benefit (1% ≤ mean differences ≤ 13%, 0.27 ≤ ds ≤ 1.18) alone (Figure 2; Table S4 in file KQ_S1_data.omv).

Click to enlarge

Figure 2

MA and KQ Consistencies Compared (WTR, Cost, Benefit)

Note. Error bars depict 95% confidence intervals.

Click to enlarge

Figure 3

MA and KQ WTRs Compared (With Observed Values)

Note. White circles depict mean welfare trade-off ratios, blue and orange marks depict observed values. Error bars depict 95% confidence intervals.

WTRs

WTRs for MA (0.73–1.08) and KQ (0.66–0.81) were high. A repeated-measures ANOVA showed that the main effect for Method was significant, F(1, 6600) = 1746.29, indicating a difference between MA and KQ in terms of overall WTR scores. The main effect for Recipient was also significant, F(3, 19800) = 1034.8, indicating a difference in WTR between recipient groups. The interaction between Method and Recipient was also significant, F(3, 19800) = 1700.58, p < .001. Post-hoc tests revealed that the MA WTRs were significantly higher than KQ WTRs, with a mean difference of 0.19, and that MA WTRs declined significantly with greater social distance, in the expected direction (family > friend > colleague > stranger). KQ WTRs did not decline with social distance (family < friend < colleague > stranger), presumably because family and friend scores were at ceiling (Table S5 in file KQ_S1_data.omv; Figure 3).

To test the assumption that kind act ratings had equal (as opposed to logarithmic) intervals, we compared the results of the current untransformed KQs, and those from KQs based on transformed (exponentiated using base 2) cost and benefit data, with the MAs. In all four cases (family, friend, colleague, stranger), the original untransformed KQs were significantly and substantially closer to the corresponding MAs than were the transformed KQs (see Table S9). Exponentiating with a higher base results in even more distant values. This suggests that it is reasonable to assume that the intervals of the kind act cost and benefit ratings used to create the KQs are similar to those of the MAs.

Convergent and Divergent Validity

To test the convergent and divergent validity of the KQ, we investigated whether each recipient-specific KQ WTR predicted the corresponding MA WTR more than the MA WTRs to other recipients—for example, whether KQ Family WTR predicts MA Family WTR more than MA Friend WTR. Pearson correlations showed that each KQ WTR correlates positively with the corresponding MA WTR, and does so significantly more than with the other MA WTRs (Table 2). Correlation comparisons using ‘cocor’ (http://comparingcorrelations.org/) showed that all correlations were significantly different from one another (Steiger’s zs > 7.76, ps < .001; see Table S6 for specific correlation comparisons). A series of multiple regressions also showed that each KQ was a unique, and the best, predictor of the corresponding MA, even when controlling for the other KQs (Tables S7a-d in file KQ_S1_data.omv).

Table 2

Convergent and Divergent Validity of KQ WTRs (Pearson Correlations)

		KQ
		Family	Friend	Colleague	Stranger
MA	Family	0.26	0.20	0.19	0.19
	Friend	0.16	0.29	0.24	0.27
	Colleague	0.12	0.21	0.36	0.28
	Stranger	0.12	0.20	0.24	0.37

Note. Correlations below the diagonal (for example, MA Friend and KQ Family) involve different measure pairings than correlations above the diagonal (for example, KQ Friend and MA Family), so values differ. All ps < .001; largest r in each column and row bolded.

Incremental Validity

To test the incremental predictive validity of the KQ, we investigated the relationship between MAs, KQs and the average Multidimensional Measure of Prosocial Behaviour (MMPB) score (α = 0.93).

Pearson correlations showed that each recipient-specific KQ WTR (family: r = 0.28; friend: 0.35; colleague: 0.37; stranger: 0.45; ps < 0.001) predicted MMPB more than the corresponding MA WTR did (family: r = 0.24; friend: 0.30; colleague: 0.29; stranger: 0.31; ps < .001). Indeed, correlation comparisons using ‘cocor’ showed that each KQ was significantly more correlated with MMPB than the corresponding MA WTR (Steiger’s zs > 2.79, ps < .006; see Table S9 of the supplementary materials). Furthermore, multiple regressions showed that whereas MAs alone explained 13% of the variance in MMPB (Model 1), the KQs explained an additional 17% (Model 2), and each recipient-specific KQ WTR made a significant, and larger, contribution to MMPB than each corresponding, and all recipient-specific MA WTRs (Table 3).

Table 3

A Multiple Regression to Test the Incremental Validity of KQ Over MA (MMPB)

		Model 1				Model 2
		B	SE	p	β	B	SE	p	β
MA	Family	0.11	0.02	< .001	0.07	0.07	0.02	< .001	0.04
	Friend	0.16	0.02	< .001	0.12	0.09	0.02	< .001	0.07
	Colleague	0.11	0.02	< .001	0.08	0.01	0.02	.60	0.01
	Stranger	0.21	0.02	< .001	0.17	0.10	0.02	< .001	0.08
KQ	Family					0.70	0.10	< .001	0.09
	Friend					0.65	0.06	< .001	0.12
	Colleague					0.58	0.05	< .001	0.15
	Stranger					0.79	0.03	< .001	0.28
		R2 = 0.13				R2 = 0.30
				∆R²		p
				0.18		< .001

Discussion

People’s decisions about how to allocate money, and whether to perform a kind act, are consistent with an underlying cost-benefit or ‘welfare tradeoff ratio’. This replicates on a larger scale the results of previous research using money-allocation tasks. And it extends and corroborates previous findings with a new method, The Kindness Questionnaire. These results suggest that people employ a welfare trade-off ratio not only when it comes to explicit and precise monetary costs and benefits, but also when it comes to the implicit and imprecise costs and benefits of real-world kind acts.

The study also extends previous research by showing that, as the theory predicts, MA and to a lesser extent KQ choices are more consistent with WTR than cost or benefit alone. This was the case for fifteen of the sixteen predictions that we tested (2 methods x 4 recipients x 2 comparisons). In other words, people do not make kind decisions based on cost or benefit alone, but rather on the ratio of the cost to benefit.

However, KQ choices were more consistent with cost than MA choices were; and—the sole prediction that was not supported—KQ choices for friends were not significantly more consistent with WTR than with cost. This may be because there is something different about monetary and kind act choices, or something different about kind act choices for friends; or it could be a methodological issue arising from the high correlation between cost and cost-benefit ratio in this sample of kind acts. Further research will be needed to test these possibilities.

The study found that the US public places a much higher value on the welfare of others (MA: ~0.90; KQ: ~0.72) than previous research with US college students (MA: ~0.48) (Delton et al., 2023), but on a par with research using larger MTurk samples (MA: ~0.78) (Forster et al., 2017), and comparable to previous research looking specifically at WTRs to strangers in need (0.60) (Sznycer et al., 2019).

The study also found that, as expected, for monetary choices, WTR declines with greater social distance. People allocated more money to family, than to friends, colleagues and strangers. This was not the case for kind act choices, presumably because the range of possible scores for KQs, especially to family and friends, was too low, and hence responses were at ceiling. Nevertheless, the decline in KQ and MA WTRs for colleagues and strangers was more or less identical.

Finally, the study found that the KQs showed good convergent and divergent validity with the MA tasks, and good incremental validity with a general measure of prosociality (MMPB). However, given that the KQ measures were at ceiling for family and friends, this version of The Kindness Questionnaire should be used to measure kindness to colleagues and strangers only.

In summary, the results of Study 1 show that people make consistent decisions about kindness, the level of kindness is high, and The Kindness Questionnaires provide a promising new way to measure kindness.

Study 2

Do people make decisions about allocating money and performing real-world acts of kindness for neighbors that are consistent with a cost-benefit ratio? What is the value of this ratio? And does The Kindness Questionnaire demonstrate good convergent, divergent and incremental validity? Study 2 provides a conceptual replication of Study 1, and tests its methodological generalizability (with new measures and targets), with a larger sample.