An approach to extracting sub-schema similarities from semantically heterogeneous XML Schemas