You are here : Home Technology LAMP Development Services LAMP Development Apache Lucene Development

Apache Lucene Development

LAMP Development Services

Ruby on Rails Development
PHP MySQL Web Development
Perl Development
MySQL Database Development
Apache Lucene Development
Drupal Development
Joomla Development
Groovy and Grails Development
Zend Framework Development
JQuery Programming
LAMP Capability Summary

Silicus LAMP Platform Development Services CapabilitiesLAMP Platform capability summary



Download

More Information
Request a Quote
Request Pricing Information
software testing glossary
LAMP Development Glossary

Apache Lucene Development

Apache Lucene and Hadoop are open source projects from the Apache Software Foundation. Apache Lucene provides full text indexing and searching capability functionality for applications, and is fast gaining popularity in implementation of Internet search engines, applications with massive search and retrieve requirements as well as local, single site searching. The Apache Lucene API is independent of the file format and hence can index PDF's, HTML's, Microsoft Word and Open document formats. Lucene, along with Apache Nutch and Apache Solr allows developers to implement the entire functionality of indexing and search libraries, crawling and HTML parsing (provided by Apache Nutch) and web based application searching including the web server (provided by Apache Solr). Other related utilities include Compass, which is a Java Search Engine Framework built on the top of Lucene.

Hadoop is the popular open-source implementation of MapReduce, a powerful tool designed for deep analysis and transformation of very large data sets. Hadoop is a powerful framework that allows for automatic parallelezation of computing tasks. A Hadoop cluster is typically used with other available frameworks and platforms to facilitate the functioning of applications working with thousands of nodes and petabytes of data. Hadoop was inspired by Google's MapReduce and the Google File System (GFS). Hadoop is built on the MapReduce engine, which is Hadoop's fundamental data filtering algorithm. Another related project on Hadoop is Apache Mahout, which is a project to produce free implementations of distributed machine learning algorithms on the Hadoop platform.

Apache Lucene development along with the Hadoop development platform can be used to develop highly powerful, sophisticated and robust search engines for just about any application, given its capability to be used for searching the entire internet. Apache Lucene and Hadoop together form the Apache Nutch project.

Silicus Apache Lucene and Hadoop Expertise and Knowledge

Silicus has a team of Lucene developers who worked on lucene implementation projects for applications in medical business intelligence. We have a good understanding of the functioning and usage intricacies for lucene programming. We have the knowledge and expertise to develop lucene applications and extending them along with the web servers to develop Solr applications and Nutch applications.

We also have the knowledge in developing Hadoop applications. Our Hadoop application development expertise involves:

Apache Lucene application, Nutch application and Solr application

  • Usage of Lucene.NET which is a class-per-class, API-per-API and algorithmatic port of the Java Lucene search engine to the C# and .NET platform utilizing Microsoft .NET Framework
  • Usage of Lucene web service which is a REST-compliant Java servlet capable of searching and managing multiple Lucene indices via the web
  • Knowledge of Pylucene, an API in python for accessing Java Lucene

Hadoop

  • Knowledge and understanding of the Hadoop core
  • Apache Pig reporting query language for Hadoop
  • Understanding of the workings of the MapReduce (MR) algorithm
  • Knowledge of the Hadoop Distributed File System (HDFS) and its inner working
  • Eclipse configuration and installation for Hadoop development
  • Usage of Amazon Elastic cloud platform along with MapReduce and the Hadoop framework
  • Knowledge of usage of Bixo web mining toolkit with Hadoop for building workflows
  • Knowledge of usage of the Cascading API for Hadoop data processing workflows

Apache Lucene and Hadoop Solutions

We focus on the design and development of scalable, high-performance search and solutions.

Web Mining

  • Extracting and processing web data
  • Business intelligence and competitive intelligence solutions

Vertical searches

  • Product and catalogue search applications using the Hadoop streaming API with C++, Perl and Python tools
  • Development of advertisement networks and building recommender system based on behavioral targeting using the Hadoop framework
  • Search engine development
  • Web site analytics solutions
  • Enterprise based Search, Natural Language Processing, and Text Analytics solutions

Silicus can provide you with an experienced team of Solr developers, Hadoop developers or Nutch developers to address your specific resource requirements in a typical offshore staffing model. Silicus can also provide you with a dedicate Lucene application, Nutch application, Solr application or Hadoop application development center to facilitate Nutch development or Solr programming to suit your long term search based application development requirements.

The advantages of availing of Perl services from Silicus are:

  • Track record in Apache Lucene implementations
  • Solution development and delivery expertise
  • Process and Quality (CMMi 3 & ISO 9001/27001 certified)
  • Flexible engagement models
Try us for a pilot

Are you looking at outsourcing projects in LAMP Technologies or in need of resources with LAMP Skills ? Just fill the form below and we will get in touch with you.

Your message has been sent successfully!
Terms of use | Disclaimer
© Copyright 2000-2009 Silicus Technologies, Inc.