Sunday, October 6, 2013

Since I'm heading to Google DevFest next week I thought it would be a good idea to at least get some working knowledge of AngularJS.  AngularJS has a number of dependencies so I created a clean copy of Ubuntu 13.04 server to install it.
Now it seems that the default NodeJS that is packaged with the distro wasn't sufficient to install some of the node packages so it was off to NodeJS to download a binary tar copy of it.

Tried a few  AngularJS samples from GIT before settling on instructions on

http://www.sitepoint.com/kickstart-your-angularjs-development-with-yeoman-grunt-and-bower/
https://github.com/yeoman/generator-angular

The instructions were sufficient to get the skeleton working.

Looking forward to doing more than Hello Worlds.

Sunday, September 29, 2013

Setting up Nutch (1.7) and Solr (4.4) - Quick Start


I blog this so that anyone(mainly me- the guy with very short term memory), can install Nutch and Solr and get it working in the shortest amount of time and fuss.


I'm using Nutch 1.7 and Solr 4.4.

Make sure you have installed Java and have set JAVA_HOME.

I am using Oracle Java on Ubuntu server.



 1.

Download binares:

My nearby mirrors:
http://mirror.nus.edu.sg/apache/nutch/1.7/apache-nutch-1.7-bin.tar.gz
http://mirror.nus.edu.sg/apache/lucene/solr/4.4.0/solr-4.4.0.tgz


2.
 Unpack nutch and solr so that you will have

~/apache-nutch-1.7
 ~/solr-4.4.0

Copy the schema for solr4 from Nutch to Solr directory:

 cp apache-nutch-1.7/conf/schema-solr4.xml ~/solr-4.4.0/example/solr/collection1/conf/schema.xml



3.
Edit the scheme.xml that you have just copied to solr directory.

vi ~/solr-4.4.0/example/solr/collection1/conf/schema.xml



Add additional line in the name field in

      
 
  <  field name="_version_" type="long" indexed="true" stored="true" multiValued="false"/   >



Such that it will look like





4.
Start Solr.

cd  ~/solr-4.4.0/example


java -jar start.jar


Check if you can load the administrative page:


http://:8983/solr/#/




5. Start crawling.

Create a text file with one or a list of URLs , one per line in ~/apache-nutch-1.7/url/seed.txt


Or use the DMOZ example in the Nutch documentation.



6. execute:

bin/nutch crawl urls/seed.txt -solr http://localhost:8983/solr -depth 3 -topN 50










Saturday, September 7, 2013

PhantomJS and Ubuntu Server



Downloaded PhantomJS on new LAMP install of 12.04 VM .

Encountered an error:
"error while loading shared libraries : libfontconfig.so.1

It won't run without an additional package :

apt-get install libfontconfig.

Problem solved

Sunday, March 24, 2013

Virtualbox: Unable to locate my USB devices


Upgraded my VirtualBox on Win8 and encountered an issue where I could not locate my USB devices to plug into the guest. The tab just shows no device.

How I solved it:

Remove virtualbox extension when running as administrator File->Preference->Extensions
Uninstall and reinstallation of Virtualbox,
Add extensions again.

Fixed.