scholar: Analyse citation data from Google Scholar

Retrieving basic information

## Define the id for Richard Feynman
id <- 'B7vSqZsAAAAJ'

## Get his profile
l <- get_profile(id)

## Print his name and affiliation
l$name
## [1] "Richard Feynman"
l$affiliation
## [1] "California Institute of Technology"
## Print his citation index
l$h_index
## [1] 62
l$i10_index
## [1] 100

Retrieving publications

get_publications() return a data.frame of publication records. It contains information of the publications, including title, author list, page number, citation number, publication year, etc..

The pubid is the article ID used by Google Scholar and the identifier that is used to retrieve the citation history of a selected publication.

## Get his publications (a large data frame)
p <- get_publications(id)
head(p, 3)
##                                    title
## 1 Quantum mechanics and path integration
## 2        The Feynman lectures on physics
## 3      Simulating physics with computers
##                                         author
## 1                         RP Feynman, AR Hibbs
## 2 RP Feynman, RB Leighton, M Sands, SB Treiman
## 3                                   RP Feynman
##                                        journal          number cites year
## 1                                  McGraw–Hill                 36040 1965
## 2                                Physics Today          17, 45 22464 1964
## 3 International journal of theoretical physics 21 (6), 467-488 13165 1982
##                                                                                  cid
## 1                      18279534692101459088,12549707430555464374,3494294362048429920
## 2 18279534692101459088,15649786516955137750,13112633961799421939,3669163321259408309
## 3                                                               15599256484525608168
##          pubid
## 1 hMod-77fHWUC
## 2 u-x6o8ySG0sC
## 3 d1gkVwhDpl0C

Retrieving citation data

## Get his citation history, i.e. citations to his work in a given year
ct <- get_citation_history(id)

## Plot citation trend
library(ggplot2)
ggplot(ct, aes(year, cites)) + geom_line() + geom_point()

Users can retrieve the citation history of a particular publication with get_article_cite_history().

## The following publication will be used to demonstrate article citation history
as.character(p$title[1])
## [1] "Quantum mechanics and path integration"
## Get article citation history
ach <- get_article_cite_history(id, p$pubid[1])

## Plot citation trend
ggplot(ach, aes(year, cites)) +
    geom_segment(aes(xend = year, yend = 0), size=1, color='darkgrey') +
    geom_point(size=3, color='firebrick')
## Warning: Using `size` aesthetic for lines was deprecated in ggplot2 3.4.0.
## ℹ Please use `linewidth` instead.
## This warning is displayed once every 8 hours.
## Call `lifecycle::last_lifecycle_warnings()` to see where this warning was
## generated.

Comparing scholars

You can compare the citation history of scholars by fetching data with compare_scholars.

# Compare Feynman and Stephen Hawking
ids <- c('B7vSqZsAAAAJ', 'qj74uXkAAAAJ')

# Get a data frame comparing the number of citations to their work in
# a given year
cs <- compare_scholars(ids)

## remove some 'bad' records without sufficient information
cs <- subset(cs, !is.na(year) & year > 1900)
ggplot(cs, aes(year, cites, group=name, color=name)) + geom_line() + theme(legend.position="bottom")

## Compare their career trajectories, based on year of first citation
csc <- compare_scholar_careers(ids)
ggplot(csc, aes(career_year, cites, group=name, color=name)) + geom_line() + geom_point() +
    theme(legend.position=c(.2, .8))
## Warning: A numeric `legend.position` argument in `theme()` was deprecated in ggplot2
## 3.5.0.
## ℹ Please use the `legend.position.inside` argument of `theme()` instead.
## This warning is displayed once every 8 hours.
## Call `lifecycle::last_lifecycle_warnings()` to see where this warning was
## generated.

Visualizing and comparing network of coauthors

# Be careful with specifying too many coauthors as the visualization of the
# network can get very messy.
coauthor_network <- get_coauthors('amYIKXQAAAAJ&hl', n_coauthors = 7)

coauthor_network
##                        author                  coauthors
## 1  Herman G. Van De Werfhorst                  Thijs Bol
## 2  Herman G. Van De Werfhorst            Daniele Checchi
## 3  Herman G. Van De Werfhorst                 Sara Geven
## 4  Herman G. Van De Werfhorst         István György Tóth
## 5  Herman G. Van De Werfhorst         Frank Van Tubergen
## 6  Herman G. Van De Werfhorst Jonathan Jan Benjamin Mijs
## 7  Herman G. Van De Werfhorst              Jaap Dronkers
## 8                   Thijs Bol Herman G. Van De Werfhorst
## 9                   Thijs Bol         Arnout Van De Rijt
## 10                  Thijs Bol             Andrea Forster
## 11                  Thijs Bol            Mathijs De Vaan
## 12                  Thijs Bol                Bram Lancee
## 13                  Thijs Bol     Christina Ciocca Eller
## 14                  Thijs Bol              Jaap Dronkers
## 15            Daniele Checchi        Massimiliano Bratti
## 16            Daniele Checchi           Giorgio Brunello
## 17            Daniele Checchi              Vito Peragine
## 18            Daniele Checchi             Marco Leonardi
## 19            Daniele Checchi Herman G. Van De Werfhorst
## 20            Daniele Checchi               Elena Meschi
## 21            Daniele Checchi            Carlo V. Fiorio
## 25                 Sara Geven              About Scholar
## 26                 Sara Geven                Search Help
## 27         István György Tóth Herman G. Van De Werfhorst
## 28         István György Tóth                Brian Nolan
## 29         István György Tóth            Daniele Checchi
## 30         István György Tóth                   Ive Marx
## 31         István György Tóth            Wiemer Salverda
## 32         István György Tóth               Tamás Keller
## 33         István György Tóth             Orsolya Lelkes
## 34         Frank Van Tubergen                 Ineke Maas
## 35         Frank Van Tubergen           Matthijs Kalmijn
## 36         Frank Van Tubergen            Agnieszka Kanas
## 37         Frank Van Tubergen                Sanne Smith
## 38         Frank Van Tubergen Herman G. Van De Werfhorst
## 39         Frank Van Tubergen            Marcel Coenders
## 40         Frank Van Tubergen             Jan O. Jonsson
## 41 Jonathan Jan Benjamin Mijs Herman G. Van De Werfhorst
## 42 Jonathan Jan Benjamin Mijs               Maurice Crul
## 43 Jonathan Jan Benjamin Mijs            Christopher Hoy
## 44 Jonathan Jan Benjamin Mijs           Willem De Koster
## 45 Jonathan Jan Benjamin Mijs        Jeroen Van Der Waal
## 46 Jonathan Jan Benjamin Mijs               Bowen Paulle
## 47 Jonathan Jan Benjamin Mijs                Noam Gidron
## 48              Jaap Dronkers              Juho Härkönen
## 49              Jaap Dronkers Herman G. Van De Werfhorst
## 50              Jaap Dronkers           Marloes De Lange
## 51              Jaap Dronkers                Bram Lancee
## 52              Jaap Dronkers          Gerbert Kraaykamp
## 53              Jaap Dronkers               Maarten Vink
## 54              Jaap Dronkers             Stéfanie André

And then we have a built-in function to plot this visualization.

plot_coauthors(coauthor_network)
## Warning: Removed 35 rows containing missing values or values outside the scale range
## (`geom_point()`).

Note however, that these are the coauthors listed in Google Scholar profile and not coauthors from all publications.

Formatting publications for CV

The format_publications function can be used for example in conjunction with the vitae package to format publications in APA Style. The short name of the author of interest (e.g., of the person whose CV is being made) can be highlighted in bold with the author.name argument. The function after the pipe allows rmarkdown to format them properly, and the code chunk should be set to results = "asis".

APA style

format_publications("NrfwEncAAAAJ", "R Thériault") |> cat(sep='\n\n')

Brodeur, A., Mikola, D., & Cook, N. (2024). Mass Reproducibility and Replicability: A New Hope. IZA Discussion Paper.

Thériault, R., Ben-Shachar, MS., Patil, I., Lüdecke, D., Wiernik, BM., & Makowski, D. (2024). Check your outliers! An introduction to identifying statistical outliers in R with easystats. Behavior Research Methods. 56 (4), 4162-4172

Miglianico, M., Thériault, R., Lavoie, B., Labelle, P., Joussemet, M., Veilleux, M., & … (2024). Pratiques cliniques inspirées par la recherche en psychologie positive. Psychologie Française. 69 (1), 85-94

Ben-Shachar, MS., Patil, I., Thériault, R., Wiernik, BM., & Lüdecke, D. (2023). Phi, Fei, Fo, Fum: Effect Sizes for Categorical Data That Use the Chi-Squared Statistic. Mathematics. 11 (9), 1982

Thériault, R. (2023). rempsyc: Convenience functions for psychology. Journal of Open Source Software. 8 (87), 5466. https://doi.org/10.21105/joss.0546

Thériault, R., Dion-Cliche, F., & Dandeneau, S. (2023). Unmet Expectations: Social Inclusion and the Interaction Between Social Anxiety and Ambiguous or Positive Feedback. Frontiers in Psychology. 14, 1-10

Thériault, R., & Dandeneau, S. (2023). Implicitly Activating Mindfulness: Does Trait Self-Control Moderate Its Effect on Aggressive Behaviour?. Mindfulness. 1-17

Thériault, R. (2023). lavaanExtra: Convenience functions for package lavaan. Journal of Open Source Software. 8 (90), 5701

Thériault, R. (2023). L’importance de la science ouverte en recherche en psychologie. PsyArXiv.

Ben-Shachar, MS., Makowski, D., Lüdecke, D., Patil, I., Wiernik, BM., Thériault, R., & … (2022). effectsize: Indices of effect size. R package version. 0.8

Lüdecke, D., Ben-Shachar, MS., Patil, I., Wiernik, BM., Bacher, E., Thériault, R., & … (2022). Easystats: Framework for easy statistical modeling, visualization, and reporting [R package]. CRAN. **.

Thériault, R., Landry, M., & Raz, A. (2022). The Rubber Hand Illusion: Top-down attention modulates embodiment. Quarterly Journal of Experimental Psychology. 75 (11), 2129-2148

Thériault, R., Olson, JA., Krol, SA., & Raz, A. (2021). Body Swapping with a Black Person Boosts Empathy: Using Virtual Reality to Embody Another. Quarterly Journal of Experimental Psychology.

Makowski, D., Lüdecke, D., Patil, I., Thériault, R., Ben-Shachar, MS., & Wiernik, BM. (2021). report: Automated reporting of results and statistical models. R Package Version. 0.4. 0

Krol, SA., Thériault, R., Olson, JA., Raz, A., & Bartz, JA. (2020). Self-Concept Clarity and the Bodily Self: Malleability Across Modalities. Personality and Social Psychology Bulletin. 46 (5), 808-820

Lüdecke, D., Makowski, D., Ben-Shachar, MS., Patil, I., Waggoner, P., & … (2019). Performance: assessment of regression models performance. CRAN: Contributed Packages.

Thériault, R. (2019). Book Review: Awareness Is Freedom: The Adventure of Psychology and Spirituality. Frontiers in Psychology. 2814

Thériault, R., & Raz, A. (2018). Patterns of Bronchial Challenge Testing in Canada. Can J Respir Ther. 54 (2), 41-47

Lifshitz, M., Sheiner, EO., Olson, JA., Thériault, R., & Raz, A. (2017). On Suggestibility and Placebo: A Follow-Up Study. American Journal of Clinical Hypnosis. 59 (4), 385-392

Numbering format

format_publications("NrfwEncAAAAJ", "R Thériault") |> print(quote=FALSE)

[1] Brodeur, A., Mikola, D., & Cook, N. (2024). Mass Reproducibility and Replicability: A New Hope. IZA Discussion Paper.
[2] Thériault, R., Ben-Shachar, MS., Patil, I., Lüdecke, D., Wiernik, BM., & Makowski, D. (2024). Check your outliers! An introduction to identifying statistical outliers in R with easystats. Behavior Research Methods. 56 (4), 4162-4172 [3] Miglianico, M., Thériault, R., Lavoie, B., Labelle, P., Joussemet, M., Veilleux, M., & … (2024). Pratiques cliniques inspirées par la recherche en psychologie positive. Psychologie Française. 69 (1), 85-94
[4] Ben-Shachar, MS., Patil, I., Thériault, R., Wiernik, BM., & Lüdecke, D. (2023). Phi, Fei, Fo, Fum: Effect Sizes for Categorical Data That Use the Chi-Squared Statistic. Mathematics. 11 (9), 1982
[5] Thériault, R. (2023). rempsyc: Convenience functions for psychology. Journal of Open Source Software. 8 (87), 5466. https://doi.org/10.21105/joss.0546
[6] Thériault, R., Dion-Cliche, F., & Dandeneau, S. (2023). Unmet Expectations: Social Inclusion and the Interaction Between Social Anxiety and Ambiguous or Positive Feedback. Frontiers in Psychology. 14, 1-10
[7] Thériault, R., & Dandeneau, S. (2023). Implicitly Activating Mindfulness: Does Trait Self-Control Moderate Its Effect on Aggressive Behaviour?. Mindfulness. 1-17
[8] Thériault, R. (2023). lavaanExtra: Convenience functions for package lavaan. Journal of Open Source Software. 8 (90), 5701
[9] Thériault, R. (2023). L’importance de la science ouverte en recherche en psychologie. PsyArXiv.
[10] Ben-Shachar, MS., Makowski, D., Lüdecke, D., Patil, I., Wiernik, BM., Thériault, R., & … (2022). effectsize: Indices of effect size. R package version. 0.8
[11] Lüdecke, D., Ben-Shachar, MS., Patil, I., Wiernik, BM., Bacher, E., Thériault, R., & … (2022). Easystats: Framework for easy statistical modeling, visualization, and reporting [R package]. CRAN. .
[12]
Thériault, R., Landry, M., & Raz, A. (2022). The Rubber Hand Illusion: Top-down attention modulates embodiment. Quarterly Journal of Experimental Psychology. 75 (11), 2129-2148
[13]
Thériault, R., Olson, JA., Krol, SA., & Raz, A. (2021). Body Swapping with a Black Person Boosts Empathy: Using Virtual Reality to Embody Another. Quarterly Journal of Experimental Psychology.
[14] Makowski, D., Lüdecke, D., Patil, I.,
Thériault, R., Ben-Shachar, MS., & Wiernik, BM. (2021). report: Automated reporting of results and statistical models. R Package Version. 0.4. 0
[15] Krol, SA.,
Thériault, R., Olson, JA., Raz, A., & Bartz, JA. (2020). Self-Concept Clarity and the Bodily Self: Malleability Across Modalities. Personality and Social Psychology Bulletin. 46 (5), 808-820
[16] Lüdecke, D., Makowski, D., Ben-Shachar, MS., Patil, I., Waggoner, P., & … (2019). Performance: assessment of regression models performance. CRAN: Contributed Packages.
[17]
Thériault, R.** (2019). Book Review: Awareness Is Freedom: The Adventure of Psychology and Spirituality. Frontiers in Psychology. 2814
[18] Thériault, R., & Raz, A. (2018). Patterns of Bronchial Challenge Testing in Canada. Can J Respir Ther. 54 (2), 41-47
[19] Lifshitz, M., Sheiner, EO., Olson, JA., Thériault, R., & Raz, A. (2017). On Suggestibility and Placebo: A Follow-Up Study. American Journal of Clinical Hypnosis. 59 (4), 385-392