Repository for Google Summer of Code Project - 2018
Parts of this module:-
- Master XML document
- Convertor
- Output XML docs in graphically compatible format
##Some sample simulation pictures from Gephi
Pharyngeal Cell Cluster
-> XML to compatible graphical format convertor
- Takes input in XML format and generates an XML document in a graphical schema that is compatible with the simulation application being used - in this case, GEPHI.
- As can be seen in flow chart above, the convertor in the middle has been written in python.
__
Lets say a user specifically wants to analyse only a certain group of cells. In that case, there is no point in generating a *.gml or *.gexf file that has all the cells because it will over-densify the graph simulated in Gephi.
For example, if user only wants to analyse the Amphid Neurons, then the following cells only will be filtered out and converted to a *.gml file :
- ADFL - AB alpppppaa (Amphid neuron, prob. chemosensory)
- ADFR = AB praaappaa (Amphid neuron, prob. chemosensory)
- ADLL - AB alppppaad (Amphid neuron, prob. chemosensory)
- ADLR - AB praaapaad (Amphid neuron, prob. chemosensory)
Here, we should make note that some cells have a totally independant lineage starting from the precursor (P0,AB,P1, etc.) right up the way to the final adult cell node.
However, some final adult cells have a n'th degree common parent ancestor cell. A hypothetical example has been shown in the below picture
So, for these cells care must be taken to not repeat the node names in the *.gml or *.gexf file. That is why, we need a script that will specifically have the following step by step task:
- Understand which group of cells user wants to analyse.
- Query the *.csv file to capture only those cells that belong to that chosen category.
- Print/write the lineage names of all the cells identified in step 2 in a text file
- Have another script that will capture these lineage names from text file generated in step 3 and nodify them. Nodify is the process of splitting the lineage into succesive generations of nodes.
- Filter out the redundant nodes that need not be simulated more than once.




