Actions

Lexical networks: Difference between revisions

From Santa Fe Institute Events Wiki

No edit summary
No edit summary
Line 30: Line 30:


Now, of course (1) and (2) above could actually end up being pursuable in terms of / within the same project: for example, the identification of synonyms within a lexical network could lead to attack tolerance (e.g. how to design a "self-healing" lexical network so that, if some of the nodes in the network are taken out, synonyms can step in to take over for the words thus taken out...)
Now, of course (1) and (2) above could actually end up being pursuable in terms of / within the same project: for example, the identification of synonyms within a lexical network could lead to attack tolerance (e.g. how to design a "self-healing" lexical network so that, if some of the nodes in the network are taken out, synonyms can step in to take over for the words thus taken out...)
==An interesting exchange with John Mahoney:==
On 6/8/07, John Mahoney <jrmahoney@ucdavis.edu> wrote:
>
> might be interesting to try to model a lex net where synonyms display resistance to becoming
> equally used. attempting to use ideas from Page's talk on consistency and coherence. some say
> ketchup.. some say catsup ?
>
> so basically thinking about the difference between semantic equivalence and equivalent use.
>
> ?-john
Hi John,
Thanks for this interesting idea.
It makes me think of each node (word) maybe as a "basin of
attraction", drawing meanings into it, with some (a few) meanings that
are perched precariously on the rim between two basins, and capable of
going down the direction of either word.
We might also have a natural tendency to "compartmentalize" the world
into discrete, non-overlapping categories  (this would make sense from
the viewpoint of evolutionary history, I think -- it might simply make
sense to carve up the world into discrete categories if you're a
hunter-gatherer on the savannahs trying to make split-second
decisions). And so maybe we can say, from one point of view, that
"language abhors synonyms" ?
And there is the competing pressure, for reasons of building
fault-tolerance into the system, etc, to get some redundancy in there,
too.
So from the point of view of a lexical network, there are good
reasons, perhaps, for nodes to be similar  (what you called "semantic
equivalence") as well as for nodes to be dissimilar (what you called
"non-equivalence of use").
Thanks again, am putting this up on the Wiki.

Revision as of 05:51, 11 June 2007

CSSS Santa Fe 2007


(suggested by Sayan)

Members: (please add yourself if you are interested )

1.Sayan_Bhattacharyya

2.

3.


Lexical networks are graphs encoding the co-occurrence of words in large texts. (If the text is sufficiently large, we can pretend that the network encodes the entire language).

In the graph, two connected words are adjacent, and the degree of a given word is the number of edges that connect it with other words. You can take a look at this paper to see what lexical networks look like: (it's a short, readable paper): The small-world of human language' by Cancho i Ferrer and Solé.


Thinking of going along either of the two possible directions for the project (but open to other suggestions):

(1) Perhaps, something along the lines of how to identify synonyms from within a lexical network / exploring suitable "metrics" for synonyms

(2) Perhaps, something along the lines of exploring attack tolerance in lexical networks (e.g. tolerance to knocking out some nodes in the network). [An interesting paper to look at here may be: "Albert, R., Jeong, H., & Barabasi, A.-L. (2000). Error and attack tolerance of complex networks. Nature, 406, 378-381.] Evidently, this can have some interesting connections to the Healing strategies for networks project, as well.

A motivation for thinking about (1) is that questions/mechanisms of analogy-making and comparison-making at all levels of cognition tend to be very interesting questions, and so (1) fits in well with some broader questions in that regard.

A motivation for thinking about (2) is that several people in the summer school are thinking of working on projects on fault tolerance / attack tolerance in small-world networks -- e.g. in biological /metabolic networks, in neural networks, etc, and so (2) would "mesh" well with similar projects with other kinds of scale-free networks that others in the summer school are thinking of working on, leading to exchange of ideas, etc.

Now, of course (1) and (2) above could actually end up being pursuable in terms of / within the same project: for example, the identification of synonyms within a lexical network could lead to attack tolerance (e.g. how to design a "self-healing" lexical network so that, if some of the nodes in the network are taken out, synonyms can step in to take over for the words thus taken out...)

An interesting exchange with John Mahoney:

On 6/8/07, John Mahoney <jrmahoney@ucdavis.edu> wrote: > > might be interesting to try to model a lex net where synonyms display resistance to becoming > equally used. attempting to use ideas from Page's talk on consistency and coherence. some say > ketchup.. some say catsup ? > > so basically thinking about the difference between semantic equivalence and equivalent use. > > ?-john

Hi John,

Thanks for this interesting idea.

It makes me think of each node (word) maybe as a "basin of attraction", drawing meanings into it, with some (a few) meanings that are perched precariously on the rim between two basins, and capable of going down the direction of either word.

We might also have a natural tendency to "compartmentalize" the world into discrete, non-overlapping categories (this would make sense from the viewpoint of evolutionary history, I think -- it might simply make sense to carve up the world into discrete categories if you're a hunter-gatherer on the savannahs trying to make split-second decisions). And so maybe we can say, from one point of view, that "language abhors synonyms" ?

And there is the competing pressure, for reasons of building fault-tolerance into the system, etc, to get some redundancy in there, too.

So from the point of view of a lexical network, there are good reasons, perhaps, for nodes to be similar (what you called "semantic equivalence") as well as for nodes to be dissimilar (what you called "non-equivalence of use").

Thanks again, am putting this up on the Wiki.