Forums

Home » Liferay Portal » English » Liferay Legacy »

Combination View Flat View Tree View
Threads [ Previous | Next ]
toggle
Jan Kot
Crawler for Liferay's Document Library - newbie question.
November 17, 2007 10:06 AM
Answer

Jan Kot

Rank: New Member

Posts: 3

Join Date: November 17, 2007

Recent Posts

Hi
I've got a Liferay (4.3.4) portal that stores some files in its Document Library. My goal is to create a crawler that (periodically) visits all documents stored in the Document Library to create a detailed documents list. Created list will be compared with the older list to detect which documents are new/changed/deleted. The new/changed files fill be retrived from the Document Library and processed. The prefered interface for clawler to use is a webservice interface.
I am a newbie and I search for some java examples/documentation/hints where to start. Any help will be appeciated.

Regards
JanK
Roman Hoyenko
RE: Crawler for Liferay's Document Library - newbie question.
November 17, 2007 11:53 AM
Answer

Roman Hoyenko

Rank: Liferay Master

Posts: 867

Join Date: October 8, 2007

Recent Posts

I am not sure why would you need that.

Are you going to have several sites with the same set of documents that you would try to compare?

You can always use the db to get the list of documents and latest ones. Liferay uses jackrabbit to store documents (http://jackrabbit.apache.org/) - you can look for code examples there. You can also look at Document Library portlet to see how it works.
Jan Kot
RE: Crawler for Liferay's Document Library - newbie question.
November 17, 2007 3:15 PM
Answer

Jan Kot

Rank: New Member

Posts: 3

Join Date: November 17, 2007

Recent Posts

I create (lucene based, Liferay independent)a search engine that searches for data retrived from different datasources like: filesystem, database, Liferay Document Library, other.I know that Liferay has its own search engine but I can't use it, instead I must retrive documents stored in its Document Library and index it on my own.

Regards
JanK
Hitoshi Ozawa
RE: Crawler for Liferay's Document Library - newbie question.
December 9, 2010 3:00 PM
Answer

Hitoshi Ozawa

Rank: Liferay Legend

Posts: 4690

Join Date: March 23, 2010

Recent Posts

I'm pretty sure that you don't need this information now, but Liferay has a feature to list the latest changed document.