Actions

CSSS 2010 Santa Fe-Student Notes: Difference between revisions

From Santa Fe Institute Events Wiki

(added in my hot programming links and some headings so they are findable)
(→‎networks: added in gephi)
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
{{CSSS 2010 Santa Fe}}
{{CSSS 2010 Santa Fe}}


(Supplementary material, suggested readings, tutorials, and papers contributed by the participants of CSSS 2010.)


=staying in touch=
For those linked in to LinkedIn, there seems to be a group that unites all [http://www.linkedin.com/groups?about=&gid=115966&report%2Esuccess=r3Tayp0nRRro3Er8iWS8vO-u_mFd11ndGIOEdAI27ES3KgpplepkOcIgotS3mJWzXqb2u21wqjDJwM CSSS alumni]  [[User:Ligtvoet|Andreas]]
For those linked in to LinkedIn, there seems to be a group that unites all [http://www.linkedin.com/groups?about=&gid=115966&report%2Esuccess=r3Tayp0nRRro3Er8iWS8vO-u_mFd11ndGIOEdAI27ES3KgpplepkOcIgotS3mJWzXqb2u21wqjDJwM CSSS alumni]  [[User:Ligtvoet|Andreas]]


=Supplementary material, suggested readings, tutorials, and papers contributed by the participants of CSSS 2010. =


== computer programming related ==
= computer programming related =


==general==
* [http://www.greenteapress.com/compmod/ Computational Modeling and Complexity Science] - An excellent, free (though not-quite-finished) PDF guide to the concepts and programming techniques needed to develop models of complex systems, from graphs and small-world networks to cellular automata and agent-based models. Uses [http://www.python.org/ Python], which I'd highly recommend for beginner and experienced programmers alike: it's great to read, easy to maintain, and makes coding fun again.
* [http://www.greenteapress.com/compmod/ Computational Modeling and Complexity Science] - An excellent, free (though not-quite-finished) PDF guide to the concepts and programming techniques needed to develop models of complex systems, from graphs and small-world networks to cellular automata and agent-based models. Uses [http://www.python.org/ Python], which I'd highly recommend for beginner and experienced programmers alike: it's great to read, easy to maintain, and makes coding fun again.
Presupposes some basic working Python knowledge, which is provided by a second of the author's free ebooks, [http://greenteapress.com/thinkpython/thinkpython.html Think Python].
Presupposes some basic working Python knowledge, which is provided by a second of the author's free ebooks, [http://greenteapress.com/thinkpython/thinkpython.html Think Python].
Line 17: Line 19:
::''Computers are as important to modern science as telescopes and test tubes. Unfortunately, most scientists are never taught how to use them effectively. After a generic first-year programming course, most scientists have to figure out for themselves how to build, validate, maintain, and share complex programs. This is about as fair as teaching someone arithmetic and then expecting them to figure out calculus on their own, and about as likely to succeed.''
::''Computers are as important to modern science as telescopes and test tubes. Unfortunately, most scientists are never taught how to use them effectively. After a generic first-year programming course, most scientists have to figure out for themselves how to build, validate, maintain, and share complex programs. This is about as fair as teaching someone arithmetic and then expecting them to figure out calculus on their own, and about as likely to succeed.''


His course is also based around python, but aimed at general purpose scientific computing and covers a lot fo the fiddly bits that are commonly missed - not jsut numerical stability, but testing, maintaining versions, sharing your results without pain etc.
His course is also based around python, but aimed at general purpose scientific computing and covers a lot fo the fiddly bits that are commonly missed - not jsut numerical stability, but testing, maintaining versions, sharing your results without pain etc. -- [[User:Dan90|Dan MacKinlay]]
 


==networks==
* Google's Graph framework is out and has some interesting notes about the problems of large network data:
* Google's Graph framework is out and has some interesting notes about the problems of large network data:
:: Many practical computing problems concern large graphs. Standard examples include the Web graph and various so- cial networks. The scale of these graphs—in some cases bil- lions of vertices, trillions of edges—poses challenges to their efficient processing. In this paper we present a computa- tional model suitable for this task. Programs are expressed as a sequence of iterations, in each of which a vertex can receive messages sent in the previous iteration, send mes- sages to other vertices, and modify its own state and that of its outgoing edges or mutate graph topology. This vertex- centric approach is flexible enough to express a broad set of algorithms. The model has been designed for efficient, scal- able and fault-tolerant implementation on clusters of thousands of commodity computers, and its implied synchronicity makes reasoning about programs easier. Distribution- related details are hidden behind an abstract API. The result is a framework for processing large graphs that is expressive and easy to program
:: Many practical computing problems concern large graphs. Standard examples include the Web graph and various so- cial networks. The scale of these graphs—in some cases bil- lions of vertices, trillions of edges—poses challenges to their efficient processing. In this paper we present a computa- tional model suitable for this task. Programs are expressed as a sequence of iterations, in each of which a vertex can receive messages sent in the previous iteration, send mes- sages to other vertices, and modify its own state and that of its outgoing edges or mutate graph topology. This vertex- centric approach is flexible enough to express a broad set of algorithms. The model has been designed for efficient, scal- able and fault-tolerant implementation on clusters of thousands of commodity computers, and its implied synchronicity makes reasoning about programs easier. Distribution- related details are hidden behind an abstract API. The result is a framework for processing large graphs that is expressive and easy to program
http://portal.acm.org/citation.cfm?id=1807167.1807184
http://portal.acm.org/citation.cfm?id=1807167.1807184  
 
* [http://gephi.org/ Gephi] is what everyone seems to be using for visualisations -- [[User:Dan90|Dan MacKinlay]]
-- [[User:Dan90|Dan MacKinlay]]


* <b> I've had some requests for R Tutorials, so here are a few things that I made for a Biostatistics class I TA'd: </b>
== stats==
I've had some requests for R Tutorials, so here are a few things that I made for a Biostatistics class I TA'd: </b>
* [http://charles.stanford.edu/~bio141/tutorials/RTutorial.pdf R Tutorial Powerpoint]
* [http://charles.stanford.edu/~bio141/tutorials/RTutorial.pdf R Tutorial Powerpoint]
* [http://charles.stanford.edu/~bio141/tutorials/RTutorial.R R Example Codes]
* [http://charles.stanford.edu/~bio141/tutorials/RTutorial.R R Example Codes]

Latest revision as of 21:35, 22 June 2010

CSSS Santa Fe 2010

(Supplementary material, suggested readings, tutorials, and papers contributed by the participants of CSSS 2010.)

staying in touch

For those linked in to LinkedIn, there seems to be a group that unites all CSSS alumni Andreas


computer programming related

general

  • Computational Modeling and Complexity Science - An excellent, free (though not-quite-finished) PDF guide to the concepts and programming techniques needed to develop models of complex systems, from graphs and small-world networks to cellular automata and agent-based models. Uses Python, which I'd highly recommend for beginner and experienced programmers alike: it's great to read, easy to maintain, and makes coding fun again.

Presupposes some basic working Python knowledge, which is provided by a second of the author's free ebooks, Think Python.

-- Daniel Jones

  • On that note, here's a note from my blog entry: My most recommended link for this trip - Greg Wilson's awesome Software carpentry course from UToronto ´is available freely online. As the man himself says:
Computers are as important to modern science as telescopes and test tubes. Unfortunately, most scientists are never taught how to use them effectively. After a generic first-year programming course, most scientists have to figure out for themselves how to build, validate, maintain, and share complex programs. This is about as fair as teaching someone arithmetic and then expecting them to figure out calculus on their own, and about as likely to succeed.

His course is also based around python, but aimed at general purpose scientific computing and covers a lot fo the fiddly bits that are commonly missed - not jsut numerical stability, but testing, maintaining versions, sharing your results without pain etc. -- Dan MacKinlay

networks

  • Google's Graph framework is out and has some interesting notes about the problems of large network data:
Many practical computing problems concern large graphs. Standard examples include the Web graph and various so- cial networks. The scale of these graphs—in some cases bil- lions of vertices, trillions of edges—poses challenges to their efficient processing. In this paper we present a computa- tional model suitable for this task. Programs are expressed as a sequence of iterations, in each of which a vertex can receive messages sent in the previous iteration, send mes- sages to other vertices, and modify its own state and that of its outgoing edges or mutate graph topology. This vertex- centric approach is flexible enough to express a broad set of algorithms. The model has been designed for efficient, scal- able and fault-tolerant implementation on clusters of thousands of commodity computers, and its implied synchronicity makes reasoning about programs easier. Distribution- related details are hidden behind an abstract API. The result is a framework for processing large graphs that is expressive and easy to program

http://portal.acm.org/citation.cfm?id=1807167.1807184

stats

I've had some requests for R Tutorials, so here are a few things that I made for a Biostatistics class I TA'd:

-- Julie Granka