PEER Network School 2023
Welcome to this PEER Advanced Network School. First, thanks to Scott Franklin, Ellie Sayre, Mary Bridget Kustusch and the PEER team for supporting this field school!
The main idea is to have some time learning about different aspects of Social Network Analysis, and to actually focus on doing the analyses. We will spend the mornings working on learning aspects of SNA, and then the afternoons will be working time, where we actually try to carry out some of the types of analyses on your own data.
|Network Theory and Data||26 June 2023|
|Individuals within networks||27 June 2023|
|Properties of whole networks||28 June 2023|
|Network models||29 June 2023|
|Projects||30 June 2023|
What might you learn?
There are lots of things that I hope you would learn: to look at the complexity of Social Networks, to use R tools to look at networks, to read code, to write code, to find that you are not afraid (and actually like) to analyze data with R. By the end of all three sessions here are things I think you should be able to do after participating in each of the workshops:
- Import Data into R
- Clean data and prepare data for SNA
- Organize data for SNA
- Vizualize data
- Understand different centrality metrics
- Plotting networks and incorporating centrality
- Understand different metrics about whole networks
- Community Detection
- Side trip to visit bipartite networks.
- Understand the framework for comparing networks
- Test hypotheses in a network framework
What is beyond the scope?
While I would love to say you’d be able to master doing network analysis in R in the short span of 1 week, you will not. I’ve spent a decade learning it and am proficient, but am constantly learning.
Similarly, R is a tool. You are the expert in your research, R can help you to uncover elements of your data. I don’t know how to analyze your data - learning R will empower you to analyze your data.
I began learning R because I wanted to have something that worked across different platforms. But then I realized that the Open Source movement was something that was particularly interesting to me. But those aren’t the reason. I have come to see R as crucial for reproducibility. If I write a set of code to analyze a dataset, I can re-run this code, and get the same results. I can share the code with my publications, so someone else can check my work. This is critical to doing science. R allows me to do this in a way that Excel doesn’t.
Certainly other languages (Python, Julia, etc…) will also allow for this, but why I have stuck with R is that there is a rich community that are supportive of learning and growing.
Open source things tend to be a bit tribal - within R one debate is Base R vs. Tidyverse. This is a debate about how to teach R, and I have decided to use the Tidyverse. It might not ever matter to you, but I want you to know that I have thought about this, and my answer is to use Tidyverse for the following reasons:
- Readability - Code should be readable by humans
- Coherence with my practice - I generally try to use the Tidyverse approach in my work, so I feel like I can best help by using these tools
- Coherence with themselves - These tools are organized in a way that helps them to be internally coherent
- Support - There are loads of books and websites that are published open-source, and so you can typically find the help you need.
Resources for Learning R
Learning R is not easy, as I have learned things, I have tried to maintain a list of the different resources that I have drawn on.
- Install R and RStudio
- Introduction to R (using Base R) - R Programming for Data Science
- My all time favorite book for learning statistics and R is Learning Statistics with R
- Danielle Navarro also has a course Robust Tools that gives a sense about the breadth of the tools that R provides.
- These two guides, The R Guide and R for Beginners
A bit more advanced
- Hadley Wickham’s book, R For Data Science
- Project Oriented Workflows
- WTF R Jenny Bryan is great at explaining things about R and so frequently, I find myself looking at her books.
- Cheat Sheets I can never remember all the commands.
- Happy Git With R Eventually you’ll need to use Git, and this is essential reading for making that transition.
- RMarkdown Markdown allows you to use reproducible code and integrate R into reporting and writing. This is how I do all my work.
- DataCarpentry runs workshops about data analysis and often are oriented to different research specialties.
- The Effect A really great book that approaches data analysis from a causal inference perspective with code in R, Python, and Stata.
Resources for Learning SNA
- Wasserman and Faust The seminal text on social networks from the sociological camp. No help with R, but includes so much that it is an essential reference.
- Newman Don’t know if I would call it the seminal text on networks from the graph-theoretic camp. No help with R. If you want to have access to the mathematical underpinnigs, this one has it (probably).
- Barabasi A newer text, it is available to read online, with some animations, cool visuals, etc. Focus on python I think.
- Sage Handbook. A comprehensive guide to doing SNA - has chapters on IRB, network generators, analytic approaches and more. Plus if you want to read about the two camps from a sociologist’s perspective, chapter four is not particularly kind to physicists.