Not sure this files under ‘data’, ‘modeling’, or both. In any case, I have a student who’s interested in extraction page structure features from late manuscript, early prints. Think coordinates of images and figures, ratio of area, dimensions, etc. She considers using ImageJ, which would work but would involve a considerable amount of handcrafted work requiring time which… who has time, right?
So I was wondering what the current state on automating such feature extraction is. And if potential tools in existing will work somewhat reliably on this kind of historic material.
Thanks for any leads!