Actions

Complex Systems Summer School 2017-Projects & Working Groups

From Santa Fe Institute Events Wiki

Revision as of 16:48, 15 June 2017 by Littlecharlotte (talk | contribs) (→‎Interested Participants)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Complex Systems Summer School 2017

Evolution Working Group/Microbial Systems are Complex

Summary

Several of us seem to have shared interests in microbial ecology/evolution/genetics and/or evolutionary ecology/evolutionary theory in general. It might be a good idea to meet up and brainstorm a project (maybe over beer?). Some potential ideas that were discussed during speed dating:

  • The evolution of the bacterial pangenome
  • Development/adaptation of classical population genetic techniques to microbes (maybe via a network describing HGT interactions)
  • The evolution of metabolism/the interaction between metabolism and community structure
  • How interspecific interactions (e.g. crossfeeding, competition) influence the rate of decay of community similarity in a homogeneous environment (scale of spatial organization)


Interested Participants

IF INTERESTED PLEASE JOIN SLACK GROUP evolution_microbes (Will not be emailing people on this list as it grows)

  • Jake Weissman (jw4336(at)umd.edu)
  • Makoto Jones (makoto.jones(at)hsc.utah.edu)
  • Hilje Doekes (h.m.doekes(at)uu.nl)
  • Basak Taraktas (basak.taraktas@northwestern.edu)
  • Alicia Kraay (amullis (at) umich.edu)
  • Ximo Pechuan joaquin.pechuanjorgd@phd.einstein.yu.edu
  • Jiří Moravec
  • Yael Gurevich (yael01[at]gmail.com)

EU: responding to migrant crisis

Updated: NEXT MEET-UP: Thursday (15jun) TODAY 4.15pm Great Hall

Keywords

cooperation, migration, consensus-reaching, supranational unions

Summary

How does an organization of independent countries, such the European Union, deal with a migrant crisis? How can a consensus on a response be reached? Why did the EU fail to deal efficiently and fairly with the 2015 migrant crisis (>1m people)? Why isn’t the migrant quota system not working? (Or is it?)

Yes, these are all pretty big questions, but I believe very exciting and timely. Haven’t figured out details obviously, but I think these could be explored with agent based models (possibly NetLogo?). It would be particularly interesting to see how a few factors influence the chances that a consensus is reached and what type of consensus:

  • country on coast/external border vs landlocked
  • how far from coastal border
  • economic power
  • diplomatic power
  • Union organization: continuum - federal government can impose policies —> unanimous consensus must be reached every time

If you find this interesting, include your name and email on the list below & let’s meet up and discuss!

Resources

Interested Participants

  • Alberto Micheletti (biology, social evolution, ajcm2[at]st-andrews[dot]ac[dot]uk) - group contact
  • Andrew Johnson (biology, ecology, competition, collapse, afjohnson[at]ucsd[dot]edu)
  • Ella Jamsin (sustainability, design, tipping points, e[dot]jamsin[at]tudelft[dot]nl)
  • Madison Hart (psychology, social evolution, collapse, madisonkhart(at)gmail.com)
  • Basak Taraktas (basak.taraktas@northwestern.edu)
  • Martina Balestra (management, human computer interaction, mb5758[at]nyu[dot]edu)
  • Hilje Doekes (theoretical biology, evolution, microbial cooperation and spite, h.m.doekes (at) uu.nl)

Gut-Brain Axis: Impact of gut microbiota on mental health

BRAINSTORMING MEETING AT 12PM OVER LUNCH on 6/14 (WEDNESDAY=TODAY). MEET IN CAFETERIA. Email Bleu (happylittleaccident AT gmail) with questions

Keywords

public health, microbiome, metabolism, network analysis, neuroscience

Summary

The term "gut feeling" is not without a scientific basis. Recent literature has emphasized the connection between the brain and intestinal microbes. Studies are beginning to link neurodegenerative disorders, such as Alzheimer's disease and multiple sclerosis, to gut dysbioses.

Other relationships between the brain and the gut are waiting to be explored. Also, on a broader level, relationships between microbial metabolism and neurotransmitter levels could be investigated.

Open source data sets are available, such as the ones mentioned in this paper: bmcsystbiol.biomedcentral.com/articles/10.1186/s12918-016-0307-y

If you find this interesting, include your name and email on the list below

Interested Participants

  • Bleu Knight (bleu AT nmsu.edu; happylittleaccident AT gmail.com)
  • Aaron Schwartz (aschwa15 AT uvm.edu)
  • Madison Hart (madisonkhart(at)gmail.com)
  • Deepa Rao [drao@mit.edu]
  • Rachel Gicquelais (rgic (at) umich.edu)
  • Ximo Pechuan [joaquin.pechuanjorge@phd.einstein.yu.edu]
  • Kyle Reing (reing@usc.edu)

Beer: evolution through the ages

BRAINSTORMING MEETING AT 5PM OVER DINNER on 6/14 (WEDNESDAY=TODAY). MEET IN CAFETERIA. Email Bleu (happylittleaccident AT gmail) with questions

Keywords

beer, evolution, genetic elements, history

Summary

Brewing beer is an ancient art that has experienced a renaissance in the last few decades. Can we model beer recipes in an "evolutionary" sense? Can ingredients like hops and malt be modeled like genes that are expressed more in some beers than in others? Can we create phylogenetic trees that delineate relationships between beers? Can beers be considered 'species', and if so, what has contributed to the diversity that we see in the US today?

If you find this interesting, include your name and email on the list below

Resources

Media:5000_yr_old_Chinese_beer_recipe.pdf
Media:Early_Iron_Age_and_Late_Mediaeval_malt.pdf
beer recipe database: beerrecipes.org
medieval english ales: cs.cmu.edu/~pwp/tofi/medieval_english_ale.html
about food (not beer): Flavor network and the principles of food pairing DP: I think it is a great reference to start with!

Interested Participants

Bleu Knight (bleu AT nmsu.edu; happylittleaccident AT gmail.com)

  • Doheum Park (park154 AT kaist.ac.kr, innovation, network science, data science)

International Trade Patterns in Local Fisheries

Keywords: cascading effects, overfishing, international seafood trade, exploitation patterns

Summary

Seafood is an increasingly globalized food commodity with now 40% traded internationally. At the same time, the majority of global fish stocks are being fully- (58.1%) or over-exploited (31.4%). International buyers connect to local fisheries as the fishery is overexploited the buyer moves on to the next location. This pattern of serial exploitation has been observed in a fisheries worldwide such as tuna, cod, sea cucumber. We aim to explore the linkages and changes in bilateral trade networks that happen before and after a fish stock collapse to find different patterns of exploitation. A starting point for the analysis may be studying anomalies and changes in the global network of seafood trade.

Include your name and email on the list below to talk more :)

Meeting Lunch Thursday June 15. Anybody interested in brainstorming, join!

Resources UN Comtrade publishes freely available bilateral trade data: https://comtrade.un.org/

RAM legacy is a database which contains marine stock assessments (collapses): http://ramlegacy.org/

Participants

Maartje Oostdijk (group contact, maartjeoostdijk@gmail.com)
Laura Elsler (group contact)
Basak Taraktas (basak.taraktas@northwestern.edu)
Junfu Zhao (zjf18810688936@gmail.com)
Elaine Bochniewicz (emb@mitre.org)
Ximo Pechuan joaquin.pechuanjorge@phd.einstein.yu.edu
Anand Nair (anandnair1@gmail.com)

Agent-Based Prediction Competition

Report of the meeting of Wednesday 14/06: https://drive.google.com/file/d/0B0mKa7Rl4z1LS3lCN190QVNOOXM/view?usp=sharing

Next meeting: Thursday 15/06 over lunch or at 16.15 (to be decided)

Keywords

Agent-Based Models, Machine Learning, Prediction

Summary

Agent-Based Models (ABMs) have so far been used mainly for descriptive purposes. Some people (including myself) think that ABMs can unleash their full potential in making predictions. I envision this project as an open-ended test to this hypothesis. The main idea is borrowed from the machine learning prediction competitions (www.kaggle.com), in which you are given a training dataset and have to outperform the other participants in predicting the outputs of a test set. Instead of using a machine learning algorithm to approximate a target function, I propose we use an ABM. This project can take several directions:

  • The topic of the ABM and the data can come from any discipline: biology, epidemiology, economics, sociology, ecology, etc.
  • The data can be real or synthetic. In the latter case, someone could code a very complicated ABM as the Data Generating Process of a fictitious system, and the task would be to approximate this complicated ABM.
  • A humble goal would be to take machine learning as an upper bound for prediction possibilities (https://site.stanford.edu/sites/default/files/submission_kleinbergliangmullainathan.pdf), and see how putting realistic assumptions in the ABM makes the prediction score close to the machine learning benchmark.
  • A more ambitious goal would be to beat the machine learning algorithms. We know they can fail because the world is non-stationary (see Google Flu Trends failure). ABMs can account for that.
  • Any other direction!

I would be happy to discuss about any practical example with whoever is interested. I think this project can be very risky, but this is why I’m proposing it here! Still, at least a proof of principle would be a great result. I'm open to any hijacking of this project however!

Group Contact

Marco Pangallo (complexity economics, marco[dot]pangallo[at]maths[dot]ox[dot]ac[dot]uk)

Interested Participants

  • Meghan Galiardi (meghangaliardi(at)gmail.com)
  • Elliot Nelson (enelson(at)pitp.ca)
  • Madison Hart (madisonkhart(at)gmail.com)
  • Martina Balestra (mb5758[at]nyu[dot]edu)
  • Uzay Cetin (uzay00(at)gmail.com)
  • Emily S (eshouppe [at] gmail [dot] com)
  • Cigdem Yalcin (cigdem_yalcin(at)yahoo.com)
  • Chris Miles (cmiless@umich.edu)
  • Andrew Johnson (afjohnson[at]ucsd.edu)
  • Stefan B (sfb311[at]nyu[dot]edu)
  • Yuji Saikai (saikai(at)wisc.edu)
  • Yael Gurevich (yael01[at]gmail.com)

AI-aided Graph Auto-generation

Keywords

Graph Auto-generation; Graph Characteristics; Artificial Intelligence; Combinatorial Optimization;

Summary

Complex networks/graphs appear in almost every aspect of science and technology, ranging from social systems, computer networks, biological networks to the state spaces of physical systems, even Bayesian Belief Networks. Graph auto-generation has wide applications. Just like what is done by the Google AlphaGo, this oftentimes ends up as a combinatorial optimization problem. The prohibitive number of possibilities calls upon the Artificial Intelligence to determine better strategic choices at every turn. There are several interesting topics.

  • As a starter, we would like to determine the topological structure of given networked systems. A variety of basic measures and metrics are available that can tell us about small-scale structure in networks, such as correlations, connections and recurrent patterns, but it is considerably more difficult to quantify structure on medium and large scales which can help us understand the ‘big picture’.
  • Next, a fundamental property of interdependent networks is that failure of nodes in one network may lead to failure of dependent nodes in other networks. This may happen recursively and can lead to a cascade of failures. In fact, a failure of a very small fraction of nodes in one network may lead to the complete fragmentation of a system of several interdependent networks.
  • Thirdly, assume we have a collection of node and a defined dependency model. Can we auto-assemble these node to form a network which satisfies performance requirements. The dependency model can be defined in the following way
   1. Each node has a defined interface, which specifies what kind of supplies it needs and what kind output it can produce.
   2. The output of a given node can be computed based on its chosen suppliers and available supply levels.
  • Finally, given a set of nodes, assuming the edge length between any pair of nodes is computable, can we find a shortest path between two given nodes such that the path consists of edges with bounded length. Moreover, given multiple measures of the edge length, can we find a Pareto front of shortest paths? In other words, a multi-objective optimization of such paths.

Group Contact

Huang Tang (MITRE, Co.)

Interested Participants

  • Sheri Xue Guo (xg8@st-andrews.ac.uk, computer science)
  • Junfu Zhao (zjf18810688936@gmail.com)

Shared rituals, memes and group identity

Keywords

meme, rituals, group identity

Summary

Shared language or rituals are important for identification of group members from non-members, which itself is important for group cohesion and group identity. In pre-state societies, where survival was dependent on intragroup collaboration, this was of high importance. In modern culture, mere survival is often guaranteed by states, but group identity is still important, whether in meatspace or in cyberspace.

This project will try to identify origin and spread of internet memes, as a form of internet rituals, their importance for group cohesion and their dead, i.e., when popular memes spread into different communities and became common in internet, while losing their original function.

Group Contact

Jiří Moravec j.moravec [at] massey.ac.nz

Join #memes channel on slack

Resources

I think this website is going to be of great help: http://knowyourmeme.com/

Interested Participants

  • shanee stopnitzky entropy@ucsc.edu - i was going to propose the exact same project! awesome!
  • aida.huerta@comunidad.unam.mx
  • Rémi Lamarque - remi.lamarque@lpl-aix.fr

Multi-Layer Networks

Keywords

networks, ecology, multilayer networks, interactions

Summary

Networks are a powerful approach to studying ecological systems; however, they typically represent a "monolayer network" of interactions in a snapshot of time. This project is interested in using a "multilayer network" approach to include more complexity in ecological networks. With a multilayer network, we can represent the same ecosystem using layers that describe the system either with several interaction types (competition, cooperation, predation, etc.), communities of species, or over time (time-series data on occurrences, for example). By using interlayer edges, we can connect the monolayers of the network.

The paper that describes this approach includes a table with ~10 available datasets (Supplementary Table 1) and leads to packages in Python, R, and Matlab (Supplementary Table 3).

Some questions include:

  • Using temporal variation in the networks -- what are the dynamics of network assembly?
  • How do different interaction types affect network stability?
  • How do interactions between food webs (via common species) affect the dynamics of each food webs separately?
  • How do processes at one level affect those in other levels?
  • How do diseases spread in interconnected populations of different hosts?
    • UPDATE** After some discussions today, it seems like there is also a lot of overlap with non-linear time series analysis. It could be possible to find a temporal dataset and do both a multi-layer network analysis and non-linear time series (using rEDM).

Resources: [1]!

Interested Participants

  • Deepa [drao@mit.edu]
  • Basak Taraktas (basak.taraktas@northwestern.edu)
  • Bleu Knight [happylittleaccident AT gmail]
  • Shing Zhan [shing.zhan@gmail.com]
  • Surendra Hazarie [shazarie@u.rochester.edu]
  • Martina Balestra [mb5758[at]nyu[dot]edu]
  • Anand Nair [anandnair1@gmail.com]
  • Ximo Pechuan [joaquin.pechuanjorge@phd.einstein.yu.edu]
  • Kayla Sale [kaylasale@email.arizona.edu]
  • Huang Tang

Crook's hyperensembles to improve Maximum Entropy Theory of Ecology (METE) and Max Ent in other non-equilibrium systems

Summary

METE is a promising approach to detecting macroecological patterns like species abundance and size from relatively small amounts of data, but fails to accurately predict interactive phenomena like size-density relationships. The non-equilibrium nature of open ecological systems is a likely constraint for using max ent for ecological applications. These may be better resolved using Crook's hyperensemble method, which is basically taking the maximum entropy of the maximum entropy of the distribution.

This project is simple, can be applied to many different types of 'ecological systems', data is available and abundant, and we should be able to reach a publishable conclusion during the program.

Interested Participants

  • Shanee Stopnitzky entropy@ucsc.edu
  • Marco Pangallo marco.pangallo@maths.ox.ac.uk
  • Alje van Dam aljevdam@gmail.com
  • Tim Kunisky kunisky@cims.nyu.edu
  • Elliot Nelson enelson@pitp.ca
  • Chris Miles cmiless@umich.edu
  • Deepa Rao drao@mit.edu
  • Ximo Pechuan joaquin.pechuanjorge@phd.einstein.yu.edu
  • Markus Junginger
  • Kyle Reing reing@usc.edu

Cryptocurrency market predictions

Summary

Can we accurately predict cryptocurrency markets using nonlinear time series analysis? Does social network 'popularity' improve predictions?

Interested Participants

  • Shanee Stopnitzky entropy@ucsc.edu
  • Abdel Abdelgabar (yomegaland AT gmail)

Complex phenomena in cetacean (whale and dolphin) communication

Summary

Click sounds produced by whales and dolphins for echolocation and communication are highly complex and possess a fractal structure. Are there discernible patterns in these sounds that could help us decode how these animals perceive the world, communicate with each other and other species (like us?) This needs more thought, but lots of sound data exists.

Interested Participants

  • Shanee Stopnitzky entropy@ucsc.edu
  • Elaine Bochniewicz emb@mitre.org
  • Kayla Sale kaylasale@email.arizona.edu

Human Movement Dataset

Summary

Dataset looking for a research question. Data is a time series of Linear and angular X, Y, Z of the upper limb of 25 different people completing a series of activities.

  • One possible thought, is there a way to recognize the difference between the way a control (normal) moves versus a stroke survivor?
  • Is there a way to identify the severity of an injury based upon movement patterns?
  • Can specific movement patterns be identified consistently?

Interested Participants

  • Elaine Bochniewicz emb@mitre.org


From Dictatorship to Anarchism (and everything in between)

Keywords

Politics, social networks, agent based modeling, network theory, network analysis

Summary

Study different types of socio-political organizations through network theory and agent based models.
Potential questions: - What kind of structures emerge as a function of, size, agents characteristics such as wealth/power distribution, etc.?
- How does different social structures contribute affect the output of different quantities such as production, self-fulfillment, ... ?
- How does the system evolve when a leader is taken down?
- Does leadership emerge?
- ...

If you find this interesting, include your name and email on the list below


To Do

By Friday, June 15

- Think about how to characterize our agents and the interactions.
- Literature review: every person reads 1 paper/topic that will be presented on Friday.
- Think about what political regimes we want to study and how to describe them.
- Is there data we could use to inform our model-building or to validate our simulation results?


Resources

ABM CODES:

 MASON (Java) -- http://cs.gmu.edu/~eclab/projects/mason/ 
RNetLogo (R) -- https://cran.r-project.org/web/packages/RNetLogo/index.html
Mesa (Python) -- https://pypi.python.org/pypi/Mesa/
Swarm (Java and Objective C )-- http://www.swarm.org/wiki/Swarm_main_page

READINGS

- Theory:
1. Veblen, T. (1934). “The Beginnings of Ownership” in his Essays in Our Changing Times.
2. Field, B. (1989). “The Evolution of Property Rights,” Kyklos 42: 319-45.
3. Foley, D. - "Socialist alternatives: from Vienna to Santa Fe", https://docs.google.com/a/newschool.edu/viewer?a=v&pid=sites&srcid=bmV3c2Nob29sLmVkdXxkdW5jYW4tZm9sZXktaG9tZXBhZ2V8Z3g6NTc3YTQyYzJkZTBiYzk2MA


- Modeling:
1. Epstein and Axell, "Growing Artificial Societies- Social Science from the Bottom Up"
2. (Hugo) Epstein, J. - "Why Model?", Journal of Artificial Societies and Social Simulation vol. 11, no. 4 12, http://jasss.soc.surrey.ac.uk/11/4/12.html
3. (Hugo) Bartha, R. , Meyerb, M. and Spitznerc, J. , "Typical Pitfalls of Simulation Modeling - Lessons Learned from Armed Forces and Business ", http://jasss.soc.surrey.ac.uk/15/2/5.html



- Networks:
1. Watts, D. and Dodds, P. - "Influentials, Networks, and Public Opinion Formation", (2007), https://academic.oup.com/jcr/article-abstract/34/4/441/1820236/Influentials-Networks-and-Public-Opinion-Formation

Interested Participants

Marjan Fadavi (Economist)
Rachel Gicquelais (Epidemiologist)
Jose Coronado (Economist)
Freya Casier (Economist)
Adrian Soto (Physicist) -- adrian.soto-cambres_aatt_stonybrook.edu
Martina Balestra (Management...Scientist?)
Madison Hart (Social Sciences) (madisonkhart(at)gmail.com)
Junfu Zhao (political economy)
Carlos Viniegra (Economics)
Ella Jamsin (Physics, Sustainability)
Hugo Serrano (Computer Science)
Valérie Reijers (Ecology)
Damon Frezza (Operations Research)
Abdel Abdelgabar (Medicine,Neuroscience,Economics)
Markus Junginger
Carla Rivera
Mark Kirstein
You?

Some datasets that might trigger an idea

DataUSA

bunch of USA data (towns, cities, states, education, skills, occupations, industries...)

https://datausa.io

Pantheon

historical database of 'globally known people' with place and date of birth and death (not sure if it's publicly available though..)

http://pantheon.media.mit.edu


Ethnographic Atlas

  • database of 1167 societies and their ethnographic descriptors (see here )
  • Corrected ethnographic atlas: here
  • some links might be dead, so here is reupload of R version: here

Publicly Available Time Series Data

https://datamarket.com/data/list/?q=provider:tsdl


Bipartite Ecological Networks

Summary

Characterizing and constructing bipartite ecological networks (using machine learning or something else?) What are the differences in structure between networks of different interactions? How do nonbiological factors inform network shape (weather, location etc.)? How do they stack into a multiplex network? Can we devise a way to create realistic networks? Given a bipartite network can we determine which type of interaction they are composed of? Can we categorize bipartite networks from other fields (like economics, physics?) as most like one of these ecological networks and then use insights for both field?

Networks, metadata, and references available at web-of-life.es Mutualistic

  • 34 plant-seed disperser
  • 143 plant-pollinator

Antagonistic

  • 4 plant-herbivore
  • 51 host-parasite

You can email me at (kaylasale(at)email.arizona.edu) to chat about ideas, feasibility, whatever!

Interested Participants

  • Alje van Dam aljvdam@gmail.com (thought a bit about country-product, occupation-city networks and their applications)
  • Surendra Hazarie : shazarie@u.rochester.edu
  • Deepa Rao [drao@mit.edu]
  • Tom Logan tomlogan@umich.edu
  • Anand Nair anandnair1@gmail.com
  • Ximo Pechuan joaquin.pechuanjorge@phd.einstein.yu.edu

Self-organization of cities

keywords

spatial patterns, self-organization, power-laws, traveling salesman

Summary

This project aims to look for patterns in the spatial arrangement of human settlements. Previously it has been shown that the size of US cities follows a power-law distribution, but is this also the case for other countries? If we for example look at the size and spatial arrangement of cities in the Netherlands the pattern seems to be quite different. In this project it would be cool to look at the spatial organization of cities.

Possible research question: - Are there differences between countries in terms of their spatial arrangement (more clustered vs more dispersed?) How does this relate to the traveling distance between cities? Does the traveling distance follow for example a power-law? How does this interact/correlates with the size of cities. - Is the spatial arrangement dependent on the environment? For example if a countries has less favorable places (heterogeneous environment) do we also observe more clustering (adaptive strategy)? How does the interaction between facilitation (more resources if together) versus competition (disease outbreaks etc) influence the size and spatial arrangement of cities (and how is this influenced by the environment)? - Can we observe differences between the time the areas were colonized? (Australia and US as immigrant countries vs european countries?). Are cities in 'older' countries more likely to be dispersed (means of transport at time of settlement influences the distance between cities). Older cities more likely to have 'memory/legacy' effect of competition?

If you are interested let's get together and meet me or Tom or email: valeriereijers(at)gmail(dot)com.


Interested Participants

  • Tom Logan tomlogan@umich.edu
  • Alje van Dam aljevdam@gmail.com

Delayed Coordinate Mapping and Predicting Epidemics (or other time series)

Summary

The goal of this project will be to learn how to use coordinate mapping and other dynamic nonlinear models for predicting time series data. Many people have had some interesting ideas like (please feel free to add your own it's not here):

- How do predictions from a mechanistic model compare with those from a dynamic model and which is better?
- Similar to above, generate a null model of noise that resembles the data, and see if coordinate mapping claims to infer something from this vs. the real data (Josh tells me there's some literature about this he could point us to).
- Evaluating the type of disease dynamics these methods could be used on (e.g. seasonal epidemics, persistent infections, sporadic ones, etc).
- Can you use these methods to predict disease epidemics at all, and what are their limits?
- Conceptual/theoretical question: does the relative performance of nonlinear time series analysis and conventional epidemic models depend on collinearity between predictors and the role of unmeasured confounders

Reading

This paper is a good place to start our discussion: http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0169050

Lunch Meeting

Let's meet today (June 14th) during lunch to discuss project ideas!

Interested Participants

  • Spencer Fox (spncrfx@gmail.com)
  • Bleu Knight (happylittleaccident AT gmail)
  • Tim Kunisky (kunisky@cims.nyu.edu)
  • Shing Zhan (shing.zhan@gmail.com)
  • Rachel Gicquelais (rgic (at) umich.edu)
  • Hilje Doekes (h.m.doekes (at) uu.nl)
  • jake weissman Ijw4336(at)umd.edu
  • Ximo Pechuan joaquin.pechuanjorge@phd.einstein.yu.edu
  • Alicia Kraay (amullis (at) umich.edu)
  • Shanee Stopnitzky entropy@ucsc.edu
  • Elaine Bochniewicz emb@mitre.org

“Can complexity science save the world?”: a one-hour journey through open-minded discussions

Summary

“Can complexity science save the world?” At dinner today, Carlos Viniegra proposed this question jokingly, mainly to mock my (Yao's) oversized project questions.

But hey, why not start with this rather silly question, take a one-hour journey of open-minded discussions, and see where we will end up? One can certainly ask this question to any number of scientists, and have a ton of fun discussions. But we all know that the whole is greater than the sum of its parts, so the discussion ought to be even more fun as a group activity!

Come and share your deep concerns of the world and humankind, the “Holy Grail” questions of your research, the annoying limitations of your methodology (the ones that got swept under a rug)? Maybe through the diverse real-world big questions, theoretical questions, and methodological questions, something interesting will emerge, and innovative solutions can follow. And here is the bottom line: what other chances are there to hear about ideas/biases/rants from such a diverse group :) ?

Plan

Time: Some hour later in this week. Perhaps Friday?
Discussion moderator: Carlos will moderate! He will apply moderation procedures to stimulate innovative ideas.
Bring: Your rants and laughters, and maybe your world-saving superpower (optional, although JP said we all have it)

Interested Participants

  • Yao Liu
  • Madison Hart
  • Alex Jurgens
  • Andres Munoz

The Formation and Evolution of Religious Communities

keywords

network analysis, agent based modeling, social networks, citation networks

Summary

In reviewing a century of anthropological data, Stark (1996) estimates that only 1 new religious group in 1,000 will last for longer than a century and gain significant membership. The success of these groups can be measured as a function of their ability differentiate from competitors in the religious marketplace, identify and elevate leaders or spokespersons, and in recruit new membership at a rate that ensures the replacement of lost membership (defection, death, etc.). This project proposes to study these aspects of the formation and evolution of new religious communities in relation to a religious movement known as Sufism, over a period of approximately 150-200 years. Data relates to the topology of the network network data related to cliques of Sufis, their most prominent spokespersons, and the rates at which these spokespersons' messages disseminated through the network. It is hoped that a preliminary agent-based model can be developed to predict significant network properties.

If you are interested please HMU: jeremy.farrell (at) emory (dot) edu.

Please add your name below if you'd like to get a piece of the action. 1 Uzay Cetin (uzay00(at)gmail.com)
2 Makoto
3 Valerie
4

From Makoto- I have an idea that relates to this, I think, that comes from the realm of antibiotic stewardship. I would be interested in doing some analysis of religious networks because I think at some point I could apply to my field. The antibiotic stewardship idea is that bacterial genes and human memes travel along related but not identical networks. In order for human memes to not die out, presumably they need to be reinforced to build critical mass. If some practice becomes monolithic and rigid then it can lead to collapse as soon as a gene for bacterial resistance arises that can move across the network unimpeded. Dominant antibiotic-resistant bacterial strains are similarly vulnerable to an attack. So in a religious context, I would be interested in what predicts branching in terms of adversarial competition. Maybe like a network implementation of Conway's game of life.

“Memory” in biological, ecological, social, and cultural systems

Summary

“Memory”, or legacy effects, are prevalent and significant in many worlds. Some examples in different systems include:

  • Vegetation that previous experienced drought might respond to water limitation differently.
  • Archeological sites (indicators of human activities in the past) are not always found in the most logical or optimal locations. Same goes for modern city developments.
  • Social practices (e.g. agricultural practices) are often driven by legacy, not optimal productivity or quality.

Potential research questions

  • Are there general ways to quantify characteristics of memory (e.g. the length and strength) across a diverse set of systems (biological, ecological, social, cultural, etc.)? Perhaps statistical, machine-learning approaches?
  • When we take memory into consideration, do we improve our prediction of the system’s behavior?
  • Sometimes, the “initial condition” of a system is unknown. Can we infer a system’s history from its subsequent behavior?
  • Under different regimes of perturbation, do memory help or hinder the function (survival, resistance, resilience, stability, etc.) of an organism, a community, a practice, a culture, etc.? The answer is likely different for a system versus for individual entities within that system.
  • What is the role of memory in its diverse forms (be it adaptation to the environment, learning from past experiences, or continuing/updated utilization of past infrastructures)? Can we view memory as a fundamental trait, on which natural selection occurs via evolutionary processes (in a broad sense)?

Interested Participants

  • Yao Liu
  • Jake Weissman (jw4336(at)umd.edu
  • Alex Jurgens
  • Marco Pangallo
  • Yael Gurevich
  • Shanee Stopnitzky

Cross-cultural Wikipedia Analysis

Keywords

Wikipedia, cross-cultural, network analysis

Summary

Wikipedia is made up of user-generated content from people around the world. There are versions in 285 languages. Assume that users generate content in their primary language and that a person’s primary language has geographic or cultural correlations. Under that assumption, one might be able to tease out differences in cultural perceptions of an event/topic/etc. based on what is said, excluded, or linked to on the page. If you find this interesting, include your name below.

Interested Participants

  • Kyle Lemoi
  • Romona Roler
  • Hilje Doekes
  • Alex Jurgens
  • Sean Wu
  • Elaine Bochniewicz
  • Madison Hart
  • Jiří Moravec
  • Rémi Lamarque

Movie Data Analysis

Keywords

Movies, modeling, complexity measures, generation and prediction

Summary

I was thinking about how people who study film often talk about "film language". Natural language is well studied (f not well understood) complex system, so I found myself wondering if we could apply some of the same methods to film. I found this database called Cinemetrics that has a large set of shot length per shot data, as well as all sorts of other data like shot framing, camera tilt, ect. I personally am interested in trying to model shot length data as a process and looking at entropy rate and statistical complexity as signatures of the data, potentially with the goal of being able to classify movies (action versus documentary, or by director, ect). There's also some interesting previous statistical work on the site that poses questions about psychology of editing, trend lines over time, and meaningful ways to find similar films. I don't think this has to be a huge project, but it seems like it could be a fun way to compare and contrast analysis skill sets across disciplines.

http://www.cinemetrics.lv/index.php

If you're interested please let me know! (Especially if you have experience in writing web scrapers.)
amjurgens AT ucdavis.edu
ajurgens on slack

Interested Participants

Alex Jurgens
Yao Liu
Chris Miles

Autocatalytic networks

Growth, innovation, booms/busts, creative destruction

Still very vague idea; for people interested:
Pnas paper: http://www.pnas.org/content/99/4/2055.full.pdf
Interpretation as model of innovation: http://people.du.ac.in/~jain/Innovation.pdf


Player Behaviour and Team Dynamics in a Competitive Online Environment

Summary

MOBAs, or multiplayer online battle arenas, are online video games that bring together random players to duke it out. These games generate tons of event data that can be utilized to test hypotheses about individual and organizational behaviour. There are outcome data that indicate victory or defeat and the performance characteristics of each player at the end of a game; there are also time series data capturing the behaviour and performance (e.g., gold accumulated and objectives accomplished) of each player throughout a game. For this project, I am proposing to analyze data from League of Legends (LoL) by Riot Games (http://play.na.leagueoflegends.com/en_US). LoL is a real-time strategy game whose classic mode involves two teams of five players. Riot Games boasts 100 million monthly players, with millions of games played daily around the world.

Riot Games has made some of the data available via its API (https://developer.riotgames.com/). Using this awesome data, we can ask questions about player behaviour and team dynamics. What kinds of players are more successful? Passive or aggressive in terms of play style (e.g., calculated or yolo)? Jack of all trades or master of one (there are many different types of characters, e.g., marksmen, melee fighters, and healers)? Leaders who call good shots or those who comfort team members that are getting rekt? What makes a good team? A leader followed by supporters, or no single leader? What happens if there are two alpha-types in a team? If there are downers or douchebags who refuse to cooperate, toxic as heck, and/or intentionally do so bad that it is practically 4 versus 6? There is some recent literature involving the analysis of LoL data. We will review some papers and then brainstorm.

Things we might be doing:
1) Build a machine learning method to classify players (e.g., selfish versus team-oriented) based on time series game performance data (e.g., number of champion kills, number of shots called, and gold earned)
2) Using the labelled data generated from step (1), test various hypotheses about (a) individual behaviour (e.g., are players who are more willing to take one for the team more successful in the long run?) and (b) team dynamics (e.g., are teams with mostly adaptive players more likely to win?)
3) Explore the dynamics underlying the formation of a competitive ladder (in competitive mode, there are various ranks, i.e.,, bronze, silver, gold, and so forth), which may shed some insights on how some people get more ahead than others in real life

The bigger question is whether we can apply whatever insights we learn to better understand other competitive and/or virtual environments (e.g., sports, hackathons, and virtual reality social platform), to build stronger teams in organizations, and to become better team players ourselves.

If interested, please give me a shout at shing.zhan@gmail.com or join the Slack channel #game-on.

Interested Participants

  • Shing Zhan (shing.zhan@gmail.com)
  • Abdel Abdelgabar (yomegaland AT gmail)
  • Bleu Knight (happylittleaccident AT gmail)

Communication and Cryptography with Chaotic Maps

Keywords

chaotic maps, information theory, information transmission, cryptography, secure communication

Summary

Chaotic maps were discussed today (June 14th) during Liz's talk on dynamic systems. There is a whole zoo of known chaotic maps (there's a pretty good list on wikipedia) and in general if you can find a generating partition for a chaotic map you can use symbolic dynamics to capture the dynamics of the continuous system with a discrete sequence. This allows for easy computation of information theoretic quantities like entropy rate, redundancy, excess entropy, ect. I would be interested in thinking about this sequence as a coding channel for a few different maps. It's also possible to use chaotic maps as a cryptographic scheme (encoding images, for example) and it would be fun to take a look at new approaches and feasibility in these methods, maybe do some information theoretic analysis. Some of this might already have been done, so this group could also be a bit of literature review on the current state of chaotic dynamics.

Sort of analysis I'm talking about: https://pdfs.semanticscholar.org/87e8/aaad9aff5705dd9529a5569c1bfd534d9583.pdf

List of chaotic maps: https://en.wikipedia.org/wiki/List_of_chaotic_maps

Other related papers:

"Introduction to Symbolic Mechanics http://www2.acqs.org/mathstat/personal_pages/williams/wilshort.pdf
"Symbolic Dynamics of Noisy Chaos" http://csc.ucdavis.edu/~cmg/papers/Crutchfield.PhysicaD1983.pdf
"Secure communication using Duffing oscillators" http://ieeexplore.ieee.org/document/6144150/
"A Canonical Partition of the Periodic Orbits of Chaotic Maps" https://www.jstor.org/stable/2000241?seq=1#page_scan_tab_contents


If you're interested please let me know!
amjurgens AT ucdavis.edu
ajurgens on slack

Interested Participants

Alex Jurgens
Jingnuo Dong
Chris Miles
Yao Liu

DNA of Capitalism

More serious title: Are multiplicative dynamics of wealth of the rich part of a countries' population a signature of capitalism?

Summary

One of the first power laws ever documented is the power law behaviour for the upper part of the wealth distribution of countries (Pareto 1897). Since then power laws in wealth distributions seem to be a well documented fact (https://arxiv.org/abs/cond-mat/0302270) and are regarded as a universal feature of economies (Yakovenko and Rosser 2009). The lower part of the wealth distribution is well approximated by a Boltzmann-Gibbs distribution and generated by additive dynamics of the process, which is what happens if wealth is mainly accumulated via labour income. However, the generating mechanism for the upper power law part is a multiplicative dynamics, as the wealth of the "richer" part is predominantly accumulated not via labour but compound interest from investments like equity, fixed incomes, real estate etc.
Now several questions pop up. Is multiplicative dynamics of the wealth of the richer part of the population only a signature of capitalist economies? So far, I am not aware of any data which presents analyses from more or less socialist economies/countries like e.g. former GDR, former Soviet Union, China, Venezuela, Cuba, Vietnam, North Korea etc.

Skills needed

I guess without a good level of the languages of the mentioned countries it's very hard to dig out data from the institutions. Hence, participants with good knowledge of German, Russian, Chinese, Spanish or Korean will be valuable. Additionally, knowledge on power law generating mechanism is valuable, which includes knowledge about stochastic processes. Furthermore, we will have to handle probably big amounts of data and need to be able to properly estimate and distinguish between exponential, logarithmic or power law decays in the tail of the wealth distribution.

If you find this interesting, include your name below.
If you have any questions or ideas just approach me (mark.kirstein[at]tu-dresden.de) as some already did or come to the first casual meeting today on Wednesday June 14th at 16:30 in the main lecture hall. Further contributions and ideas to expand this project are warmly welcome!

Interested Participants

  • Carla Rivera

State Capacity with agent-based modelling / Conflicts and Evolution

Key words: tax, world-systems, migration, agent-based modelling, class.

Summary

State capacity is a key word in the international society. Generally speaking, it involves the ability of one state to achieve certain goals. One crucial part of state capacity is to collect enough tax so that the state could work effectively. Then the questions will be: are the people willing to pay tax? And are the state officials willing to work hard? If we introduce the multi-state system, it could be even more complex. If the people in one state are taxed too heavily, they would migrate to other states. And some strong states can require tributes from some weak states with war threats so that they can buy off their people.

We can work together to model this dynamics!

resources

1. Growing Artificial Societies- Social Science from the Bottom Up ( by Epstein, Axtell)

Interested Participants

  • Junfu Zhao (zjf18810688936@gmail.com)
  • Sheri Xue Guo (xg8@st-andrews.ac.uk)
  • Zhiya Zuo

Organic Boundaries/Cluster within Cities

Summary

Our cities are often divided into many different districts or areas; these areas are usually for urban planning, urban management and administration purposes. These boundaries are fundamentally artificial.

In reality, the city has its own way of organizing its function and activities, e.g. a facility may attract people from not only this area, but from the surrounding areas; a few estates may be very well integrated due to certain reasons.

So in that case, the city would have its own organic, self-organized boundaries/cluster. If we can develop a way to identify these boundaries, they would be very helpful to the urban planning and management, e.g. gain better understanding in the city's spatial patterns, determining the optimal distribution of resources and facilities. The project intends to analyse Singapore as a case study with real dataset.


Interested Participants

  • Zhou Yimin (ZHOU_Yimin@mnd.gov.sg)
  • Xue Guo (xg8@st-andrews.ac.uk)
  • Sean Wu (slwu89@berkeley.edu)
  • Doheum (park154@kaist.ac.kr)
  • Alicia(amullis@umich.edu)
  • Xinya (cinya.he@gmail.com)
  • Tom Logan tomlogan@umich.edu
  • Kyle Reing (reing@usc.edu)

Network's evolution: response and spread

Summary

Through discussion with people, we've seen similarities in the spread of information (e.g, religions, languages, diseases) on networks of people. We would like to explore in more abstract sense if the spread of information follows similar patterns in every kind of complex system. The aim of the project would be to create an abstract model of information spread that could apply to a variety of fields.

Interested Participants

  • Spencer Fox (spncrfx@gmail.com)
  • Rémi Lamarque (remi.lamarque@lpl-aix.fr)
  • Yael Gurevich (yael01[at]gmail.com)
  • Elliot Nelson (enelson@pitp.ca)
  • Huang Tang
  • Rachel Gicquelais (rgic[at]umich.edu)

Agent Based Modeling and Delay Coordinate Embedding

Summary

Often cases in my work, you have very little or no data. To get an idea for the behavior of they system, we often create agent-based models to get a feel for what the real system would do. Without data, we cannot calibrate the parameters of the ABM or validate the results. This leaves us to doing sensitivity analyses and uncertainty quantification in order to give our customers confidence in the results. I think Delay Coordinate Embedding could be a useful tool in performing additional analyses on the results of our ABMs. The main idea is to use the output of an ABM and treat it as a non-linear time series data. Then see if we can use DCE to get information about the dynamics of the "data". My goal is to show proof of concept for an ABM that outputs data where DCE concludes chaotic behavior, but just from analyzing the results of the ABM you might now have seen the chaotic behavior. I am not saying this will always give additional information to the results of an ABM, but I think it is a tool we can add to our toolkit to evaluate the performance of ABMs.


Interested Participants

  • Meghan Galiardi (meghangaliardi (at) gmail.com)
  • Tom Logan tomlogan@umich.edu
  • Shanee Stopnitzky entropy@ucsc.edu
  • Marco Pangallo