Site scraping
Posted: Wed Mar 16, 2011 4:12 pm
Hello,
For a school project, we're creating a website which makes searching for music scores a lot easier for people who teach in music. It's main advantage is that it's possible to search on instrumentation and duration, so the teacher can find a score which makes it possible to let his 2 students who play the flute practice together with his 2 students who play piano for let's say 45 minutes. There are several groups with the same assignment. One project probably will go live after completion.
Now we're looking for a start for our database. We saw your immense source of scores. We were thinking of indexing all the available scores on imslp, extracting the instrumentation and length, combine this with a link to imslp, and putting it in our database. This way our visitors could find scores based on a given instrumentation and/or length and would be redirected to imslp.
Users would also get the possibility to add a link to a score which isn't available in our database.
When trying to get all the information from your wiki, we got banned because of 'site ripping'. This isn't very surprising, because we are indeed ripping a part of the content of the site. Now my question is if it is permitted to rip only the part 'title', 'composer', 'instrumentation', 'length' and 'link'. Sorry for trying before asking for permission.
Thanks a lot,
Jeroen
student @ University of Ghent, Belgium
For a school project, we're creating a website which makes searching for music scores a lot easier for people who teach in music. It's main advantage is that it's possible to search on instrumentation and duration, so the teacher can find a score which makes it possible to let his 2 students who play the flute practice together with his 2 students who play piano for let's say 45 minutes. There are several groups with the same assignment. One project probably will go live after completion.
Now we're looking for a start for our database. We saw your immense source of scores. We were thinking of indexing all the available scores on imslp, extracting the instrumentation and length, combine this with a link to imslp, and putting it in our database. This way our visitors could find scores based on a given instrumentation and/or length and would be redirected to imslp.
Users would also get the possibility to add a link to a score which isn't available in our database.
When trying to get all the information from your wiki, we got banned because of 'site ripping'. This isn't very surprising, because we are indeed ripping a part of the content of the site. Now my question is if it is permitted to rip only the part 'title', 'composer', 'instrumentation', 'length' and 'link'. Sorry for trying before asking for permission.
Thanks a lot,
Jeroen
student @ University of Ghent, Belgium