Vocal convergence during formation of social relationships in vampire bats
Julia K. Vrtilek, Grace Smith-Vidaurre, Eric Fosler-Lussier, Rachel A. Page, Gerald G. Carter

TL;DR
Vampire bats develop similar vocalizations when housed together, and this convergence is linked to cooperative relationships like food sharing.
Contribution
Demonstrates experimentally that co-housing and food sharing drive vocal convergence in vampire bats.
Findings
Co-housing causes vocal convergence in vampire bats.
Food-sharing rates among non-kin bats predict increased call similarity.
Cooperative relationships lead to further vocal convergence beyond initial co-housing effects.
Abstract
In many group-living birds and mammals, the formation of affiliative relationships is hypothesized to cause vocal convergence (an increase in call similarity between individuals). However, testing this causal effect can be difficult, because it requires experimentally forming new relationships. Here, we demonstrate convergence in the contact calls of common vampire bats (Desmodus rotundus) that we introduced and experimentally housed together. To estimate and disentangle the roles of kinship, co-housing (familiarity), allogrooming and food sharing in predicting call similarity, we first measured call similarity using 35 features of 693 494 contact calls from 95 bats, then fitted a series of Bayesian generalized multi-membership models. We also measured changes in call similarity for a subset of individuals who were recorded before and after co-housing. We found that co-housing caused…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2|
model |
question addressed |
covariates included as predictors of contact call similarity |
bat pairs in sample |
|---|---|---|---|
|
1 |
how well does kinship predict contact call similarity? |
kinship |
pairs with estimated kinship = 2179 |
|
2 |
how well does kinship predict contact call similarity after controlling for co-housing and affiliation? |
kinship, co-housing (true/false), affiliation rate (including zeros for bats that never met) |
pairs with estimated kinship = 2179 |
|
3 |
in non-kin females, how well does co-housing predict contact call similarity? |
co-housing (true/false) |
pairs of non-kin adult females = 1159 |
|
4 |
in non-kin females, how well does affiliation predict contact call similarity? |
within-group affiliation rate (excluding bats that never met) |
pairs of non-kin adult females that were caged together = 272 |
|
5 |
in non-kin females, how well does food sharing predict contact call similarity? |
within-group food-sharing rate (excluding bats that never met) |
pairs of non-kin adult females with estimated food-sharing rates = 139 |
- —National Science Foundation
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Neuroendocrine regulation and behavior · Primate Behavior and Ecology
Introduction
Vocal behaviour provides an important window into the social lives of many birds and mammals. Exchanges of vocalizations called ‘contact calls’ seem to facilitate recognition, reunions and collective movements among affiliated individuals across several species (e.g. African elephants [1], common marmosets [2] and white-winged vampire bats [3]). In many group-living birds and mammals, the contact calls of unrelated members of the same group sound more similar than those of strangers (e.g. black-capped chickadees [4], orange-fronted conures [5], budgerigars [6], yellow-naped amazons [7], pygmy and common marmosets [8–10], Guinea baboons [11], grey mouse lemurs [12], gelada [13], African elephants [14], bottlenose dolphins [15,16], orcas [17], greater spear-nosed bats [18] and Taiwanese leaf-nosed bats [19]). Based on this observation, a longstanding hypothesis is that the formation of close affiliative relationships (‘social bonds’) can increase similarity in the acoustic structure of calls between individuals (‘vocal convergence’) [20].
Although vocal similarity or convergence is often associated with affiliative relationships [10,18,19,21–25], there are multiple possible mechanisms for how and why vocal convergence occurs. First, familiar animals might have the ability to modify the structure of vocalizations based on auditory inputs (vocal production learning), causing similarity between familiar individuals (e.g. [4]). However, vocal convergence exists in many mammalian lineages where vocal production learning has not yet been demonstrated [26]. In these cases, it is possible that animals might be learning from conspecifics to produce existing sounds in their repertoire in new contexts (usage learning). Finally, familiar animals might use the same vocalizations more often simply because their physiological or motivational states often match [26].
Regardless of the underlying mechanism, demonstrating a causal link from affiliation to vocal convergence can be difficult. For example, correlations between affiliation and vocal similarity can result from kinship, through either heritable variation or shared environment. The same correlation can result from pre-existing vocal similarity causing higher affiliation (assortment based on acoustic similarity). To demonstrate causation, an ideal study would induce new relationships to form among non-kin, then observe changes in call similarity (e.g. [4]).
Here, we applied this approach to demonstrate convergence in the contact calls of common vampire bats (Desmodus rotundus). Vampire bats produce contact calls when they are socially isolated [27] and use the calls to vocally recognize affiliated individuals [28]. By analysing 693 494 contact calls from 95 bats, some of which were experimentally co-housed together, we found that co-housing causes vocal convergence. Next, to tease apart how pairwise call similarity was shaped by relationships within each captive group, we estimated effects of genetic relatedness, allogrooming and food sharing [29–32]. In this species, both kin and non-kin females develop reciprocal allogrooming relationships that can become reciprocal food-sharing relationships, in which a fed bat regurgitates potentially life-saving ingested blood to a starving partner [29–31]. We found evidence that these food-sharing relationships cause further vocal convergence beyond the initial convergence caused by co-housing.
Methods
Study subjects
(a)
The 95 common vampire bats (D. rotundus) used in this study came from 8 different origins (e.g. captive colonies or field sites; for details, see electronic supplementary material, table S1). To identify individual bats, all bats were marked using 1–4 unique bat bands. We labelled bats as ‘adults’ if they were older than 2 years old at time of recording or, if their exact age was unknown, by the presence or absence of an epiphyseal gap between their metacarpals. We refer to volant non-adults as ‘subadult’. Non-volant pups that were still nursing from their mother were not included in the study.
Kinship, social familiarity and cooperative relationships
(b)
Bats captured from different sites hundreds of kilometres apart were considered both unrelated (non-kin) and unfamiliar. Kinship of bats from the same colony was estimated through a combination of microsatellite markers and maternal pedigrees, as described in previous studies [29,31]. Pairs of bats with unknown kinship were excluded from analyses where kinship was a predictor.
Most pairs (n = 2605) were unfamiliar because they had never met; 582 pairs were captured from different field sites hundreds of kilometres apart, then introduced in captivity and caged together (co-housed) for at least 4 months; 612 pairs were captured from the same field site then caged together for at least 4 months; and 435 pairs were living together in captivity for years. Another 231 pairs had unknown levels of kinship and familiarity, because they were captured from the same wild roost about four years apart.
Within each co-housed captive group, pairs had variable rates of cooperative behaviour. We had measures of food-sharing rates for 307 pairs, allogrooming rates for 190 pairs, and both measures for 406 pairs. These data were collected from a series of past studies where experimenters repeatedly induced food sharing by removing a bat, fasting it for over 24 h and reintroducing it to its home cage, then used focal follow sampling to observe allogrooming, regurgitated food sharing, or both [29–31].
Recording contact calls from individually isolated bats
(c)
Contact calls are frequency-modulated syllables that are produced by socially isolated bats [27]. To record contact calls, we isolated an individual bat inside a plastic storage bin (about 75–200 l in volume) lined with acoustic foam and/or soft fabric to dampen echoes. To keep the bat within 10–30 cm of the microphone, we placed it inside a small soft-mesh butterfly cage or a tube of plastic mesh. Calls were recorded with an Avisoft CM16 ultrasound condenser microphone (frequency range 10–200 kHz, purchased prior to 2017 from Avisoft Bioacoustics, Berlin, Germany). Sound was digitized with 16-bit resolution at sampling rates ranging between 250 and 500 kHz through an Avisoft UltraSoundGate (one-channel or four-channel) to a laptop running the program Avisoft Recorder. To remove any effects of digitization rate on subsequent call measures, recordings digitized at a rate above 250 kHz were downsampled to 250 kHz using the seewave R package [33]. To filter out sounds that were not contact calls, we removed possible calls with durations less than 3 ms or longer than 50 ms and with peak frequencies below 10 kHz. For further explanation of call selection, see electronic supplementary material, methods.
We analysed 693 494 contact calls from 95 of the 115 common vampire bats that were individually isolated and recorded from 2011 to 2019. We only analysed the 95 bats that produced at least 100 contact calls of sufficient quality to measure acoustic features (described below). The number of recording sessions (nights) per bat varied from 1 to 22 (mean = 7.36). The mean number of calls recorded per bat was 7300 (median = 2500, minimum = 118, maximum = 98 161).
Measuring acoustic features of calls
(d)
We wrote a custom R script to identify start and end times of individual calls using both amplitude and spectral density (the distribution of amplitude across frequencies). This automated method was tested and validated with 1535 manually labelled recordings containing ~7000 call selections. Using these start and end times, we then measured 35 features of each call (figure 1; electronic supplementary material, table S2). We used the R package warbleR [34] to measure 27 spectral and temporal parameters and the R package soundgen [35] to estimate 8 additional measurements from the fundamental: maximum slope, minimum slope, absolute minimum slope; the numbers of positive slopes and of negative slopes; the number of turns; the mean slope; and the number of segments. We eliminated ‘clipped’ calls, which we identified as calls that reached a relative amplitude of more than 0.99 using the normalize function from the R package tuneR [36]. All scripts were run using the Ohio Supercomputer [37].
Vampire bat contact call. Call measures shown here include 1 duration, 2 time quantile 25%, 3 time median, 4 time quantile 75%, 5 mean frequency, 6 frequency quantile 25%, 7 frequency median, 8 frequency quantile 75%, 9 maximum dominant frequency, 10 mean dominant frequency, 11 minimum dominant frequency, 12 start dominant frequency, 13 end dominant frequency and 14 peak frequency. Electronic supplementary material, table S2, defines the above measures and 21 additional measures not shown here.
Assessing vocal individuality
(e)
To assess and confirm baseline individuality in contact call structure, we estimated the classification rate for assigning calls to the correct caller using a linear discriminant function analysis (DFA) with leave-one-out cross-validation, including all variables measured (via the MASS R package [38]). To choose a minimum number of calls per bat, we compared these classification rates to the same rates from a single DFA without cross-validation, and we excluded bats with fewer than 100 sampled calls because within-bat classification rates were virtually identical for bats with greater than 100 calls. To confirm that calls could be assigned to the caller with greater-than-chance accuracy, we ran a permutation test: we repeated a DFA without cross-validation 1000 times with randomized bat names to generate an expected distribution of accuracy rates under the null hypothesis that calls contain no information about caller identity.
Estimating contact call similarity
(f)
We defined contact call similarity (S) for each pair of bats as
where D is the Mahalanobis distance between two group centroids (bats) from a DFA classifying calls (n = 693 494) to bat (n = 95). These Mahalanobis distances represent the average multivariate distance (or dissimilarity) between every two bats. We normalized our measure of call similarity to range from 0 to 1; however, we replaced the single similarity score of 0 with 0.001 (the next lowest call similarity was 0.03), because similarity cannot be precisely zero and beta regression cannot model an outcome of zero.
Estimating effects of kinship and social relationship on call similarity
(g)
To assess predictors of contact call similarity while controlling for the effect of both bat identities, we fitted five Bayesian generalized (beta distribution) multi-membership models (table 1) using the R package brms [39]. Multi-membership models are used to model dyadic data by fitting group-structured random effects, which capture the dependence of each pair on the identities of both individuals. This model structure allowed us to estimate a random intercept for each bat and its influence on its pairings with other bats [40]. To assess causal hypotheses, we estimated multiple predictors across seven models based on known and hypothesized causal paths (figure 2a).
Contact call similarity in vampire bats is predicted by co-housing, affiliation rate and food-sharing. (a) Causal model for vocal convergence, with black arrows indicating causal paths of interest, and grey arrows indicating causal paths that cause potential confounding. Path A: greater kinship causes greater affiliation. We estimate affiliation through allogrooming and food-sharing. Path B: kinship might cause greater similarity in contact calls if call features are heritable. Path C: affiliation might cause vocal convergence. Path D: affiliation is caused by co-housing. Path E: co-housing leads to acoustic exposure, which might also cause vocal convergence independent of variation in social affiliation. (b) Bayesian posterior probability distributions for seven regression coefficients from five models (colours) selected to disentangle causal effects of kinship, co-housing, affiliation rate and food-sharing. Dots and error bars show means and 95% credible intervals. See electronic supplementary material, table S3, for values of coefficients and credible intervals.
Our first goal was to disentangle the effects of kinship from social familiarity. In model 1, we estimated how well kinship predicted contact call similarity across the 2179 pairs of adult bats with known kinship estimates. Kinship is confounded with group-level familiarity and within-group affiliation rates, and any effects of kinship in model 1 could therefore be driven by social interaction history rather than kinship. In model 2, we therefore estimated how well kinship predicted contact call similarity when conditioning on (i.e. statistically controlling for) affiliation rates in the same 2179 pairs. Predictors were kinship, whether the bats were co-housed (true/false), and within-group affiliation lograte, defined as
where ‘observed’ was the seconds of either allogrooming or regurgitated food sharing between the two bats and ‘sampling’ was the total seconds in which allogrooming or food sharing could have been observed between the two bats. Both allogrooming and food sharing are directed behaviours that indicate an affiliative social preference (discussed further below). We used logrates rather than rates because seconds of allogrooming and food sharing are lognormal. We scaled (i.e. z-transformed) both kinship and affiliation lograte to convert units to standard deviations.
Another way to control for kinship is to only test unrelated pairs. In model 3, we measured how well being caged together (true/false) predicted contact call similarity in the 1159 pairs of unrelated female adults caught from different sites. All pairs had kinship estimates of 0, allowing us to test for an effect of non-kin being caged together on their contact call similarity. We focused on female pairs because the few males in these colonies were sub-adults or young born into the group.
Our next goal was to separate the effects of group-level familiarity and dyad-level affiliation rate among non-kin females. In model 4, we measured how well within-group affiliation (scaled lograte) predicted contact call similarity in 272 unrelated pairs of female adults that were caged together. We removed pairs with kinship estimates greater than 0.05 to test for an effect of non-kin social interactions on contact call similarity. We also removed pairs that could never have been in contact. We again focused on female pairs because social affiliation data were focused on females and undersampled or missing for males [29–32]. We did not include allogrooming and allofeeding as separate covariates in model 4 because we believe they are highly correlated proxies for social affiliation, rather than two separate effects. The affiliation rate therefore includes either allogrooming, regurgitated food sharing, or both. The results of model 4 must therefore be interpreted carefully, because affiliation rate is based more on allogrooming in some pairs than others. For example, in one colony from 2010 to 2014, food-sharing rates were sampled more than allogrooming, but in another colony in 2019, food-sharing rates were unknown (missing) for all pairs. Moreover, food sharing represents a much higher level of affiliation than allogrooming. For example, when bats are introduced in captivity (i.e. become familiar), most pairs form allogrooming relationships and only a few begin food-sharing relationships [29]. In other words, co-housing predicts allogrooming better than food sharing.
To remedy these issues, we used our last model to focus on food-sharing rates, which is the best indication of a within-group social bond [29]. In model 5, we measured how well food-sharing (scaled lograte) predicted contact call similarity in the 139 unrelated pairs of female adults that were caged together. We again removed pairs with kinship estimates greater than 0.05, pairs that could never have shared food and pairs with unknown food-sharing rates.
Across all models, we used the default priors in brms and scaled continuous predictors. We used posterior predictive checks to assess model fit (electronic supplementary material, figure S2). To ensure Markov chain Monte Carlo (MCMC) convergence, we checked the R-hat values across 4 chains, which ranged from 0.9999 to 1.0004, showing that all 4 chains reached the same estimates for coefficients. We used chain and warmup lengths of 6000 and 2000 samples, respectively. The lowest Bulk and Tail effective sample sizes across all models were 8747 and 10 064, respectively.
Comparing call similarity pre-introduction and post-introduction
(h)
We could determine how call similarity changed over time using a subset of cases in 2016 where seven bats from Tolé were recorded both before and after being introduced to five bats from Las Pavas (electronic supplementary material, figure S3). We predicted that contact call convergence (increases in call similarity) should be greater for introduced pairs (from different sites) than for previously familiar pairs (from the same site). To compare call similarity before and after introduction, we first performed two DFAs classifying calls to bats. Both DFAs included five Las Pavas bats and seven Tolé bats, but the first DFA included recordings from the Tolé bats before introduction to the Las Pavas bats, and the second DFA included the same Tolé bats recorded after the introduction. From each DFA, we extracted Mahalanobis distances and call similarities as described above. We then estimated the change in similarity as the difference between post-introduction and pre-introduction call similarities.
To test the hypothesis that increases in similarity were greater for introduced pairs than for familiar pairs, we used two approaches. First, we used a non-parametric permutation test (Mantel test with Spearman correlation) of the correlation between the change in contact call similarity and a binary familiarity matrix indicating familiar versus introduced pairs. Second, we used a Bayesian multi-membership model (Gaussian distribution, default priors in brms, 4 chains with length of 6000 interactions, 2000 warmup samples) to test whether familiarity predicted the change in contact call similarity.
Finally, we tested the prediction that call classification to the correct capture site should be more accurate prior to the merging of the two groups. To do this, we used two permuted DFAs, which are designed to test for correct classification of calls to social group [41]. In the first permuted DFA, we classified calls to either the Las Pavas group or the pre-introduction Tolé group. In the second permuted DFA, we classified calls to either the Las Pavas group or the post-introduction Tolé group.
Results
Contact call variation within and between individuals
(a)
Contact calls had clear individual signatures but were also highly variable within bat (electronic supplementary material, figure S1). DFA correctly assigned the 693 494 calls to the 95 callers with 36% probability, which is more than twice the expected chance level of 14% (95% quantiles of expected values: 14.14–14.15%). The call assignment accuracy of assigning calls back to individual bats from the cross-validated DFA and from the non-cross-validated DFAs was almost perfectly correlated across the 95 bats (r = 0.9996), indicating robustness of the DFA used for estimating call similarity.
Kinship-biased contact call similarity can be explained by social familiarity
(b)
Model 1 revealed that kinship predicts contact call similarity (figure 2b; electronic supplementary material, figure S4). However, kinship is confounded with social affiliation in vampire bats [30,31,42]. We therefore used model 2 to test for an effect of kinship while conditioning on (i.e. ‘controlling for’) affiliation rate and co-housing. We no longer found an effect of kinship while conditioning on these measures, but we did find that call similarity was predicted by both affiliation rate and co-housing while conditioning on kinship and each other (figure 2b). This result does not mean that kinship did not influence call similarity; only that the correlations between kinship and contact call similarity can also be explained by social familiarity. In other words, both affiliation rate and familiarity predicted contact call similarity, regardless of kinship.
Co-housing causes contact call similarity among non-kin
(c)
To further test whether co-housing causes contact call convergence among non-kin, we used model 3 to focus on 1159 pairs of unrelated female adults that were captured from different sites. Compared to model 2, which included kin, we found that co-housing was associated with an even larger increase in call similarity among pairs of unrelated adult females (figure 2b). As expected, call similarity for non-kin from distant sites that were housed together in captivity was dramatically higher than non-kin from different sites that never met, and the highest mean similarity in non-kin contact calls occurred within non-kin pairs from the same wild roost or in the same long-term captive colony (electronic supplementary material, figure S5).
Within co-housed groups, food-sharing rates predict further vocal convergence
(d)
Although model 2 revealed an effect of affiliation (allogrooming and food sharing; electronic supplementary material, figure S6), further analyses showed that this effect was driven by food sharing rather than allogrooming (figure 2b; electronic supplementary material, figure S7). Specifically, model 4 estimated the effect of within-group dyadic affiliation rate on call similarity among unrelated female adults, and this model found no clear effect of affiliation rate beyond the effect of previous familiarity. However, model 5 showed that contact call similarity was predicted by within-group rates of regurgitated food sharing. This effect of food sharing on call similarity was largely driven by the 2014 long-term captive colony (electronic supplementary material, figure S7), which had the highest rates of food sharing and the longest periods of familiarity.
Vocal convergence is evident from call similarity before and after co-housing
(e)
We saw evidence for vocal convergence over time (increased contact call similarity after introduction) in 25 of 35 pairs from different sites (71%), compared to 11 of 21 familiar pairs from Tolé (52%). We found that a greater increase in call similarity (i.e. greater vocal convergence) was positively associated with being an introduced pair of bats from different sites (Mantel test: Spearman’s correlation = 0.28, n = 12 bats, p = 0.011; Bayesian multi-membership model: coefficient for being introduced = 0.06, Bayesian 95% credible interval = 0.02–0.11). Consistent with vocal convergence, the ability to correctly classify calls to capture site (Las Pavas or Tolé) decreased from 72% to 65% after bats from both capture sites were merged into one colony.
Discussion
Variation in social affiliation drives convergence of contact calls in vampire bats
(a)
For this study, we analysed a 9-year collection of contact calls recorded from a large sample of vampire bats for which we had known kinship and social interaction histories. We observed multiple cases of contact call convergence within both kin and non-kin pairs with different levels of familiarity (from completely unfamiliar to long-term co-housing) and relationship development (from mere familiarity to reciprocal food-sharing relationship). Using these data, we demonstrated that familiarity, which was induced by co-housing bats in captivity, causes vocal convergence. We showed that the correlation between kinship and call similarity can be explained by social affiliation among kin. Interestingly, food-sharing rates among familiar non-kin within the same co-housed group predicted contact call similarity, even beyond the effect of familiarity caused by co-housing. These findings are the first clear evidence of vocal convergence in vampire bats, and the first evidence of vocal convergence in bats based on within-group variation in cooperative relationships.
The mean levels of call similarity among non-kin were exactly what one would predict if familiarity causes vocal convergence (electronic supplementary material, figure S5). We observed the lowest contact call similarity among non-kin that never met and clearly higher call similarity among non-kin that were captured from different wild roosts and then co-housed in captivity. We observed the highest levels of call similarity equally among wild-caught non-kin from the same roost that were co-housed together, and among captive-born non-kin co-housed in the same long-term captive colony for years (electronic supplementary material, figure S5).
Food sharing was associated with vocal convergence, but unexpectedly, we found that call similarity among co-housed bats was not predicted by allogrooming. This finding was clear from the difference between the effect of overall affiliative behaviour (allogrooming and food sharing) and the effect of only food-sharing rate (figure 2b; electronic supplementary material, figures S7 and S8). One interpretation of this result is that introduced (co-housed) bats vocally converge to some degree, and then additional convergence occurs for the subset of bat pairs with even stronger bonds (indicated by frequent food sharing). Compared to food-sharing relationships, allogrooming relationships are less stable and less selective. All food-sharing relationships include allogrooming but not all allogrooming relationships involve food sharing, and when we co-housed previously unfamiliar adult female vampire bats for almost a year, 78% of those pairs began allogrooming while only 16% of them began food sharing [29]. Moreover, the allogrooming relationships in this study were changing more than in a typical colony due to sickness behaviour from infections [43] and experimental manipulations such as forced proximity [29,32]. Our analyses averaged these complex dynamics over the course of an entire study into a single mean pairwise allogrooming rate, which might not necessarily reflect the observed call similarity for the two bats that were each recorded at different times. In contrast, food-sharing rates in a long-term captive colony were reliable, strong and stable indicators of a close social bond.
Variation in affiliation and vocal convergence in other species
(b)
We observed vocal convergence in vampire bats at two levels: group-level familiarity (co-housing) and within-group close social bonds (food-sharing relationships). We believe this to be the first evidence of partner-specific vocal convergence in bats, in contrast with convergence on a group-level signature, which has been observed in the harsh broadband ‘screech’ contact calls of greater spear-nosed bats (Phyllostomus hastatus) [18] and the tonal constant-frequency echolocation calls of Taiwanese leaf-nosed bats (Hipposideros terasensis) [19]. Outside bats, partner-specific vocal convergence has been reported in multiple birds and mammals, including in bottlenose dolphins [15,44], orange-fronted conures [45], African elephants [46] and Campbell’s monkeys [23].
Vocal convergence at multiple levels of association (e.g. relationship and group), as we detected here, has also been observed in several other avian and mammalian vocal learners. In budgerigars, males entering a new flock adapt their social call to match the group [47], and mated pairs develop a shared call [48]. Monk parakeets show acoustic convergence at the pair, flock and site social scales [49]. In crossbills, ecologically diverged but sympatric groups have discrete call variants, and some bonded pairs produce nearly identical call structures [50]. In mammals, evidence of multi-level vocal convergence has come from cetaceans (e.g. [15,16,44]) and non-human primates (e.g. [8,11,24]). Bottlenose dolphins have individual signature whistles [15,44], but one group has also been shown to share a stereotyped whistle contour [16]. Male Guinea baboons live in a multilevel society and individuals have more similar grunts within gangs (the largest group size) than between gangs, and within a party (several parties to a gang) than between parties [11]. In the pygmy marmoset Cebuella pygmaea, there is strong evidence for convergence at the pair level and weak evidence at the group level: individuals converge the duration and structure of their trill calls in response to pairing with a new mate, but only converge duration when introduced to a group of strangers [8,24]. In general, vocal convergence at multiple levels of social familiarity may be widespread, but there is not yet a clear pattern in how or why it is used by callers and receivers.
Is vocal convergence an adaptive social behaviour or a byproduct of vocal plasticity?
(c)
Many authors argue that vocal learning requires complex and therefore costly neurological adaptations and is therefore unlikely to evolve unless there is a significant fitness advantage [51]. One possible explanation for vocal convergence in vampire bats and other echolocating bats is that vocal plasticity evolved for echolocation, and echolocating bats might alter their calls based on auditory inputs as a byproduct of audio–vocal feedback mechanisms that function for echolocation. According to this hypothesis, vocal convergence does not serve a social function and instead only results from variation in an individual’s exposure to the calls of other individuals. However, this hypothesis is contradicted by our findings: captive vampire bats that could always hear all colony members in their cages converged with food-sharing partners more than with other cagemates. By contrast, a simple exposure effect would lead only to group-level convergence on calls made by the most vocal bats. Instead, we observed biased vocal convergence towards specific group members depending on the relationships between them. Future work could further assess this hypothesis by comparing how social contact (clustering, allogrooming, food sharing), as distinguished from acoustic contact alone, influences call similarity.
Vocal convergence in vampire bats might have social causes but not social consequences, or alternatively, there may be social consequences that influence the development of cooperative relationships. For example, it may be a form of behavioural coordination—a key feature of the formation of many social bonds (e.g. coordinated courtship behaviours, vocal exchanges and turn-taking, and reciprocal allogrooming). If partner-specific vocal convergence requires time, it might act as an honest signal of affiliation and investment. On the other hand, if vampire bats can rapidly imitate the calls they hear, then call matching may be used to address specific receivers. For now, the functions of vocal convergence remain unclear.
Are vampire bats true ‘vocal learners’?
(d)
Definitions of vocal production learning differ among authors. Some authors include modifying extant call types (which is often difficult to distinguish from vocal usage learning), sometimes called ‘limited vocal learning’. Others emphasize the acquisition of entirely new call types (which is the best possible evidence for vocal learning), sometimes called ‘complex vocal learning’ [26,52–57]. Complex vocal production learning is rare among non-human mammals, but several non-human species—including cetaceans, pinnipeds, elephants and several species of bats—can copy artificial or novel sounds, including human speech [14,26,52,58–63].
Our findings show that vampire bats are at least ‘limited’ vocal learners, in the sense that social interactions cause increased similarity in either call structure, usage, or both. Although our data are consistent with the hypothesis that adult vampire bats are capable of vocal production learning, further study on variation in call syllables and responses to playbacks would be necessary to assess whether vampire bats are modifying the acoustic features of existing calls in response to auditory input [20,52,53,55]. Interestingly, the line between convergence in syllable structure versus syllable usage becomes blurred if contact call variation in vampire bats is not discrete but rather continuously variable. Future work could further delineate contact calls into different syllable types and assess the amount of convergence and individuality within these subjectively defined ‘call types’.
Evidence consistent with vocal learning has been found in at least 11 species of bats, distributed across 8 families [63]. Only six of these species have shown direct evidence of vocal learning [18,19,59–61,64,65]. In the pale spear-nosed bat (Phyllostomus discolor), adults can shift the fundamental frequency of their tonal social calls towards a downward-pitch-shifted version of their calls [60], and pups change their isolation calls to resemble their mother’s individual frequency modulation over time [64]. In the greater spear-nosed bat (P. hastatus) individuals changed frequency and temporal characteristics of noisy, broadband ‘screech calls’ to converge on a new group signature as their group composition changed [18]. In Taiwanese leaf-nosed bats (H. terasensis), adult bats in the same colony shifted their echolocation calls in the same direction gradually throughout the year, and individuals that joined the colony shifted to match the colony’s mean call frequency [19]. In the greater horseshoe bat (Rhinolophus ferrumequinum), pups shifted towards the frequency of their mother’s echolocation calls more than to other group members [65]. In Egyptian fruit bats (Rousettus aegyptiacus), pups raised in normal conditions narrowed their vocal repertoire to fewer adult vocalizations, while experimentally isolated pups lagged dramatically in call development or never developed adult calls [61]. In the greater sac-winged bat (Saccopteryx bilineata), pups of both sexes learnt territorial song syllables from adult males [22,59,66]. In general, echolocating bats are remarkably specialized for flexibility in vocal production [52,60] and have evolved sophisticated audio–vocal systems that allow them to adapt their echolocation calls in response to acoustic inputs [67–70]. Given the exceptional vocal flexibility of bats, the evidence for vocal learning across the bat phylogeny, and the fact that bat vocal repertoires are large and not well understood [68], it is possible that all bats are capable of vocal convergence or even vocal production learning.
Conclusion
Manipulations of co-housing in captivity showed that social familiarity causes convergence in contact call structure in vampire bats. Further vocal convergence occurs among familiar female bats that form close social bonds, indicated by dyadic rates of food sharing. Although captive vampire bats in our studies were acoustically exposed to all colony members in their cages, they appeared to converge with food-sharing partners more than other familiar cagemates, suggesting that vocal convergence is not completely explained by mere exposure. The social causes and consequences of this newly discovered vocal convergence have yet to be fully explored.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Soltis J, Leong K, Savage A. 2005 African elephant vocal communication I: antiphonal calling behaviour among affiliated females. Anim. Behav. 70, 579–587. (10.1016/j.anbehav.2004.11.015) · doi ↗
- 2Miller CT, Beck K, Meade B, Wang X. 2009 Antiphonal call timing in marmosets is behaviorally significant: interactive playback experiments. J. Comp. Physiol. A 195, 783–789. (10.1007/s 00359-009-0456-1)PMC 378789819597736 · doi ↗ · pubmed ↗
- 3Carter GG, Fenton MB, Faure PA. 2009 White-winged vampire bats (Diaemus youngi) exchange contact calls. Can. J. Zool. 87, 604–608. (10.1139/Z 09-051) · doi ↗
- 4Mammen DL, Nowicki S. 1981 Individual differences and within-flock convergence in chickadee calls. Behav. Ecol. Sociobiol. 9, 179–186. (10.1007/BF 00302935) · doi ↗
- 5Balsby TJS, Bradbury JW. 2009 Vocal matching by orange-fronted conures (Aratinga canicularis). Behav. Processes 82, 133–139. (10.1016/j.beproc.2009.05.005)19501135 · doi ↗ · pubmed ↗
- 6Dahlin CR, Young AM, Cordier B, Mundry R, Wright TF. 2014 A test of multiple hypotheses for the function of call sharing in female budgerigars, Melopsittacus undulatus. Behav. Ecol. Sociobiol. 68, 145–161. (10.1007/s 00265-013-1631-5)24860236 PMC 4031240 · doi ↗ · pubmed ↗
- 7Wright TF. 1997 Regional dialects in the contact call of a parrot. Proc. R. Soc. Lond. B 263, 867–872. (10.1098/rspb.1996.0128) · doi ↗
- 8Elowson AM, Snowdon CT. 1994 Pygmy marmosets, Cebuella pygmaea, modify vocal structure in response to changed social environment. Anim. Behav. 47, 1267–1277. (10.1006/anbe.1994.1175) · doi ↗
