Friday, May 23, 2014

Verified Certificate after Completed Full Course

Below in my verified certificate, I got this after completed full course including exercises , final project and  exit interview in which Udacity asked question about Hadoop & MapReduce and my final project.

Sunday, May 18, 2014

Udacity Hadoop Course–Good Insight About Hadoop & MapReduce

Today I completed Introduction to Hadoop & MapReduce course from udacity. It is very useful course to get insight about mapreduce & hadoop. I  like way they are teaching . I specially like udacity way of asking questions and quizzes in the middle of lecture.
Udacity is offering both free & paid versions of  hadoop course.Course is taught by Cloudera professionals Sarah Sproehnle (Vice President of Educational Services at Cloudera) and Ian Wrigley(Senior Curriculum Manager at Cloudera).Full content of course is accessible even in free version but in paid version you will get special attention, you will get one coach for your assistance and at the end of the course you can get verified certificate if you will able to meet their expectation. Before getting certificate there is exit interview in which they will verify that you created that project or not. The course is consist of total four lectures and each lecture has multiple small questions and exercises. Following are lectures topics
  • Introduction to Hadoop, Big Data, Data Science & MapReduce
  • The Basics of HDFS, MapReduce and Hadoop cluster.
  • Writing a MapReduce program to answer questions about data.
  • MapReduce design patterns. (I like the most)
It is better to implement all exercises by your self  as I did, but for  an overview or getting an idea about solution. you can also see solution of first three lectures exercises from below link
I also uploaded some solutions for lectures four exercise at githup, Might be It could helpful for you.
Final project is also consist of four challenging questions that you have to do it by your self to learn and get certificate. I did it and it’s your turn to challenge your self.
Ohh one more challenging point, may be you need to learn python like me because this course is only available in python.

Wednesday, October 10, 2012

PDFBOX Merging Multiple PDF

PDFBox is providing excellent support for merging multiple PDF files into one. You just need to add following jars in your project and use below code to merge multiple pdfs 1. PDFBox.jar 2. apache-common-logging Ref:
     * SyntaxHighlighter
import java.util.UUID;

import org.apache.pdfbox.exceptions.COSVisitorException;
import org.apache.pdfbox.util.PDFMergerUtility;

public class PDFMergeTest {
 public static void main(String args[]) throws COSVisitorException, IOException{
  PDFMergerUtility mergePdf = new PDFMergerUtility();
  String folder="D:\\sample";
  String numberOfFiles[]=getFiles(folder);
  for(int i = 0; i < numberOfFiles.length; i++){
   mergePdf.addSource(folder + File.separator + numberOfFiles[i]);
  mergePdf.setDestinationFileName(folder + File.separator + "final.pdf");

 private static String[] getFiles(String folder) throws IOException
  File _folder = new File(folder);
  String[] filesInFolder;   

   filesInFolder = _folder.list();
   return filesInFolder;
   throw new IOException("Path is not a directory");