I’m planning to run regular workshops over the next few months on how to get the most out of Global25 data with various programs, and expecially PAST (see here). So if you have Global25 coordinates, please stay tuned.
To that end, I’ve put together four color-coded, PAST-compatible Global25 datasheets with thousands of present-day and ancient samples, available at the links below:
PAST is an awesome little statistical program and simple to use. The manual is available here. To kick things off, here’s a quick guide how to run a Neighbor Joining tree on your Global25 coordinates:
– download the Global_25_PCA_pop_averages_scaled.dat from the last link above
– open the dat file with something a little more advanced than Windows notepad, like, say, TextPad (see here)
– stick your scaled coordinates at the bottom of the sheet, so that they look exactly like those of the other samples, except give yourself an original symbol, like, say, a black star
– open the edited dat file with PAST and choose all of the columns and rows by clicking the empty tab above the labels
– then, at the top, go to Multivariate > Clustering > Neighbor joining
After a few seconds you should see a nice, color-coded tree like the one below, except you’ll also be on it, in black text. I’m very happy with these results, by the way. As far as I can see, all of the populations and individuals cluster exactly where they should.
Those of you who are already very proficient in using PAST, feel free to go nuts with these new datasheets and show us the results in the comments below. I’ll try to put together a workshop for beginners within the next couple of weeks.
Modeling genetic ancestry with Davidski: step by step