CSSS 2010 Santa Fe-Projects & Working Groups
From Santa Fe Institute Events Wiki
CSSS Santa Fe 2010 |
Students are required to craft a research project -- use this page to brainstorm and organize your efforts.
Evolution of Words (Dan Rockmore) - In a class on complex systems that I teach at Dartmouth one of the final projects seemed to indicate from a small and somewhat biased sample of English words, that word origins (as indicated by one of the online dictionaries) seem clustered at certain times. As a start I would propose a mining of this info in some online dictionary, performing some initial analysis and see if "there is a there, there.." and if so, keep on going.
Dynamics of Equities Market Structure (Dan Rockmore) - In a paper of mine w/some of my buddies (some of whom you will meet this summer), "Topological Structures in the Equities Market," PNAS December 30, 2008 vol. 105 no. 52 20589-20594, we found some interesting structure in the correlation network of the NYSE equities market. This required a choice of a time window. It would be interesting to see how/if this structure changes over time and window size, especially on either side of market crises. Scott Pauls has code that could be used to do some of this analysis.
Movement Careers of Couchsurfing.org members (Bogdan State) - I am working with Couchsurfing.org and two Stanford Professors in trying to analyze this social movement organization's member data. One aspect both we and the Couchsurfing management are interested in is the evolution of members in the movement over time. I would like to perform a preliminary analysis of these "movement careers", using a sample of about 10,000 nodes (out of 1.7 milion) we are scheduled to obtain soon.
"Genes for Breakfast" (Yixian Song) - I've once read a paper of Redfield(1993) "Genes for Breakfast: The Have-Your-Cake and-Eat-lt-Too of Bacterial Transformation". Though it's an old publication, I still find the idea very inspiring. Well, considering bacteria living in a gene-pool with abandoned DNA strands, each bacterium can randomly "eat" free DNA strands, and use them as nutrition or for DNA repairing or even gene improvement. But the DNA strands were abandoned for a reason. Some of them can be virulent.(!!!) Besides bacteria can exchange DNA with each other, of course. We can define a population size of bacteria, amount of free DNA strands in gene-pool, percentage of virulent DNA and their virulence (impact on the bacteria fitness). We certainly can also consider the bacteria as a metapopulation.("A metapopulation consists of a group of spatially separated populations of the same species which interact at some level." - says wikipedia.org) The question to be answered will be "in which situation the bacterial population will become extinct in the end".
Patterns in Cenozoic Western US volcanism (Leif Karlstrom) - Allen Glazner (UNC) has put together a neat database of volcanic activity over the past 65 million years in the Western US (here's a movie of it), including location, duration of activity and lava composition. This data is derived from several careers worth of geologic mapping and dating volcanic rocks exposed all over the West. While it is not complete (not everything is preserved, and not everything has been mapped yet), there is a wealth of information about volcanic processes in here. I think it would be neat to mine this dataset for correlations, then think about ways to model it. This could include actual physics and geology, but could also be based solely on the data.
Pitch diffusion in groups of musicians (Leif Karlstrom) - When the violin section of an orchestra tunes, the concertmaster gets up and plays a note that all the rest of the violins try to match. I did some experiments in my undergrad with John Toner (physics, U Oregon) where we looked at what happens when the frequency of this tuning note shifts during the time when players are actively trying to match one another. We found that the shifted pitch diffuses through group if it is a small shift (a few Hz), but is immediately sensed by the whole group if it is a large shift. This implies that there is a shift from local to long-range interaction that governs how pitch matching occurs. We envisioned a process similar to flocking behavior in birds for the local interactions, which is governed by an advection-diffusion equation. But we were unable to model the data with this model, because it does not allow for long-range interactions. I still have the data, and would be interested in thinking again about how people process sound in groups.
- Sounds like a cool topic! A quick question: do you have data on the social structure of the orchestra? It would be interesting to look at the formal hierarchy, as well as at the informal social network, and see if it has any influence on pitch diffusion, especially for the long-range interactions.
Language Evolution in an Archipelago (Erika Fille Legara) - The Philippines is an archipelago containing 7,106 islands with three broader divisions (three main islands): Luzon, Visayas, and Mindanao. It has around 175 individual languages, four of which already have no more known speakers. Moreover, the Constitution recognizes eight (8) major and twelve (12) regional languages (statistics are taken from Wikipedia on the Philippines). It is also interesting to note that most Filipinos know at least three languages: (1) his/her native language, (2) Filipino, and (3) English. Now, if I could get data on the different language distributions (per year or per decade) within the archipelago, it might give us new insights on how certain languages evolve. It would also be interesting to model or predict which languages would eventually thrive and die. Also, I'd like to predict what would happen to certain languages at certain regional boundaries after a few decades or a few centuries. And finally, taking a hint from Professor Dan's idea (above), it may also be interesting to look at how certain words in the Filipino dictionary evolve through time. Caveat: I still need to check if we could have the data available before June.