Towards Easier Cross Matching with Astrogrid: AstroFederator
Classes of Cross match
there are a number of possibilities where we'll be cross matching tables of various sizes.
Very Small
one or a few objects against many tables
Small to Small
a few thousand objects against a few thousand objects
Small to Medium
a few thousand objects against tables of upto 1M objects
- can use STILTS, but note that in a CEA-like context moving the larger catalogue to wherever STILTS is running may be expensive
Small to Large
a few hundred objects against tables of over 1M objects
- if the larger table has some cone-search like interface, fake the crossmatch by doing one cone search for each row of the smaller one. STILTS multicone automates this at the client side for actual Cone Search services, and I think Eduardo has done something similar using python scripts. Mark and Kona are thinking about/working on server-side functionality that does the same kind of thing, which should be more convenient and faster for users.
Medium to Medium
up to 1M objects against up to 1M objects
- can use STILTS, but note that in a CEA-like context moving the large catalogues to wherever STILTS is running may be expensive
Quite Large to Quite Large
several million objects in either or both tables
- The algorithms in STILTS could be souped up to cope with this on request, I think -- MarkTaylor
Really Large to Medium
hundreds of millions against hundreds of thousands
- This could maybe be done by ingesting the medium table into a database holding the large one and doing a match in the database.
- It could be done outside a database using custom crossmatching code, if the large table was available in non-database form.
Really Large to Really Large
hundreds of millions against hundreds of millions
- Since it is not feasible to move either table from one place to another, this can only be done by a data centre holding both catalogues. It would presumably be done once for each pair [that may be] of interest, rather than on user demand.
- ROE have some pre-calculated neighbour catalogues generated along these lines: see WSA crossmatching.
- CDS have for several years been threatening to prepare pre-calculated indices and hence a service which will allow spatial crossmatches between any of their catalogues, even the large ones. This has so far failed to appear however.
- The data centres will have their own run-once way of doing this match. One good algorithm for sky matches is using the sort-sweep algorithm described by Devereux, Abel, Power and Lamb.
Other Approaches
Openskyquery
Starglobe
Multi-Catalogue Multi-Cone Search
--
RichardMcMahon - 14 Jul 2007 added MCMCS link
--
NicholasWalton - 13 Jul 2007
--
JonathanTedds - 12 Jul 2007
Topic revision: r9 - 2007-10-02 - 16:36:30 -
MarkTaylor