My requirement is to extract text and audio from a flv file. please suggest me how can i achieve this using any web crawler. if it is not possible with web crawler please suggest me any other tool.

Thankyou

有帮助吗?

解决方案

Using Nutch you can parse and extract metadata from the FLV file. If the text has been added into the file as part of the metadata you can retrieve it with Nutch and put it into a database.

But you probably should be looking at a combination of wget (to download the content) + "a FLV stream extraction tool" to achieve what you require.

Nutch

Wget

FLV metadata

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top