Cloudera’s Hadoop developer certification is the most popular certification in the Big Data and Hadoop community. As I’ve recently cleared the CCD-410 exam, I want to take the opportunity to provide few points that helped me with preparing for the exam and more importantly learning Hadoop in a practical way.
Here are these:
- Tom White’s Hadoop: The Definitive Guide book is an invaluable companion for you to clear the exam. This may be the only book you need as this will help you to address almost all conceptual questions in the exam. Be sure to grab the 3rd edition (latest till date) that covers YARN.
- Don’t overlook the other Apache projects in Hadoop’s ecosystem like Hive, Pig, Oozie, Flume, and HBase. There will be questions testing your basic understanding of those topics. Refer to the related chapters in the Tom White’s book. Also, there are always very good YouTube videos and tutorials available on the web.
- Understand how to use Sqoop. The best way to start may be to create a simple table in MySQL (or any database you choose) and import the data into HDFS as well as in Hive. Understand the different features of the Sqoop tool. Again, Tom White’s book can be used as well as the Apache Sqoop user guide.
- Understand Hadoop fs shell commands to manipulate the files in HDFS.
- To clear the exam you need to be hands-on in the basics of MapReduce programming, period. You will find a lot of questions in the CCD-410 exam asking about the outcome/possible result set based on a given MapReduce code snippet. You need to know and practice is how to convert the common SQL data access patterns into MapReduce paradigm. Also, there will be questions to test your familiarity on key classes used in the driver class and the methods used (for example: Job class and how it is used to submit a Hadoop job)
Tip: Create two simple text files with few records similar to standard emp and dept tables. Load the files into HDFS. Then develop and test your MapReduce programs to produce outputs similar to the following queries:
- Select empl_no, empl_name from emp;
- Select distinct dept_name from dept;
- Select empl_no, empl_name from emp where salary > 75000;
- Select empl_name, dept_no, salary from emp order by dept asc, salary desc
- Select dept_no,count(*) from emp group by dept_name having count(*) > 1 order by dept_name desc;
- Select e.empl_name, d.dept_name from emp e join dept d on e.dept_no = d.dept_no;
6. You are expected to understand basic Java programming concepts. This is no sweat for the persons regularly working in the Java environment, but for the rest of us a basic Java refresher course will be very handy. Pay particular attention to the following topics that will be very helpful in writing and understanding MapReduce codes.
- Regular Expression
- String handling in Java
- Arrays processing
- Collection Framework
7. Finally, don’t forget to refer to the Cloudera website for the latest updates, study guides and sample questions for the specific certification you are targeting.
Note that you can optionally buy a practice test from Cloudera website. If you have a good preparation and want to self-check your exam readiness you may try this out (Disclaimer: I did it).
I also recommend that you to go through the following article from Mark Gschwind’s BI Blog. The article gives you a solid direction to jumpstart your preparation as well as learning Hadoop.
All the best in your journey to learn Hadoop and get certified! Please share your experience and comments.
Tagged: CCD-410, certification, Cloudera, hadoop, Hadoop certification, Hadoop exam
hai dude,very excellent info. This info i have like that.if it’s find out the all info.Thank you so much.
Hadoop Training in Chennai
Hi,
Its great info. I was confusing about if the exam includes questions from code snippet,pib,hive and etc. You clarified me thanks. ecosystem questions will be general or indepth questions?
please provide some more detail on the pattern like how many chapters i have to focus.
snthil@gmail.com
Thanks,
Senthilselvan.D
Hi Senthil,
Thanks for your question.
Ecosystem questions seem to test our knowledge on the overall concepts and usage patterns. Go through the sample questions in the Cloudera website and always check their latest course outline to ensure you get the latest information.
To me, you need to cover Chapter 1 to Chapter 8 in the Tom White’s book “Hadoop The Definitive Guide” for Hadoop questions.
Excellent write up! Good for newbees who’re prowling the internet searching for where to get started!
Hi Pinaki,
Could you please share your email id, I have some questions which I wish to discuss offline. Highly appreciate your help bro….! Thanks !
Great tips for the ccd410
Thanks for the info, this is helpful.
Hello Pinaki,
Excellent info! Thanks much for your time and sharing the details!
I just started preparation on CDH410 exam for hadoop developer. Can you please share some sample questions or some material reg. this, will help me in great way.
Here is my email id: rahul4usa@gmail.com
Thanks & Regards,
Rahul
this is great information to check before taking exam
If you have some sample questions and would like to share, can please send it to rajkumarsowmy@gmail.com
Thanks for this useful information.
Thank for promising, well organized information – pinaki!
Hi pinaki , Please send me some sample question to my mail id :saurabhkg.jnp@gmail.com
Most of the companies especially Indian based companies prefer who has certified by cloudera. what you have explained everything very useful to the folks who planning to get a cloud era certification like me. thanks to share your valuable experience.
Venu.k
Hadoop developer in Hyderabad
Hi,
This is very good great info to learn hadoop.
Thanks to share this info to us.
Also please notify me if u have some examples and questions about CCD.
sumitthakur040388@hotmail.com
Thanks & regards
Sumit
Thanks folks for your comments and responses. One request – let’s focus our discussion on “how to prepare / sharpen our skill” and not on the “sample questions”. Here let’s talk on how we can prepare better and hone our Hadoop skills to get through the certification.
Hi,
I am currently working on a Insurance Policy Admin System for a service based company.I have no hands on experience in Hadoop and have knowledge of just basics in Java(being from a background of Electronics). I do use XML tools like Altova Mapforce and XMLSpy but on a very limited scale. In order to upgrade my skills I have attended Classroom training for Hadoop and also have begun self study. I would like to know will it be possible to clear CCD 410 without having worked on this framework???
hello guys, how do you register for the exam ?
I took it Aug 1st week successfully and happy/lucky to emerge on the passing side (just). Code outcome prediction (which obviously ate up most of my 90 mins) formed a reasonable chunk. Requires an unshakeable understanding (object-oriented, not just a theoritical knowhow) of MR functioning for sure as every moment it challenges your basic funda amassed over last couple of months or so. Shell, Hive, Scoop questions also seemed beyond basic to me. Lastly, Tom White rocks! Hope this helps and all the best!
hi, this is one of the good direction to me. hope I will clear in first attempt.
looking forward to your feedback..
Hi, I am prepairing for the ccd410 and just wanted to clarify one point do we get all the question on Hadoop 2.0 or we might get on Hadoop 1.0 as well (i.e. Job Tracker, Task Tracker etc.)
Thanks in Advance
Great info and helpfull for the certification. Thank u very much.
Great info and very helpful for certification, Thank you very much.
Great info and very helpfull for certification, Thank you very much
Your blog is much effective and thanks for sharing information.. Here we are providing training & materilas and dumps for those who are preparing for cca 500 certification and hadoop developer certification training and in the related fields of Hadoop Certification
hadoop certification
cca 500 certification
hadoop developer certification training
hadoop certification
cca 500 certification training
Hadoop Developer Certification
hadoop certification exam
ccah certification
hadoop certified developer
hadoop certification training
hadoop administrator certification training
cca 175 certification