Posts

Showing posts from March, 2020

AWS :: AWS GLUE :: Creating ETL in cloud

Image
For any type of consultation, query or doubt. You may contact to the following: (+91) 9804 436 193 debabrataguha20@gmail.com  and join the group  https://www.facebook.com/groups/331231161093613/ AWS Glue is a cloud service that prepares data for analysis through automated extract, transform and load (ETL) processes. Let’s jump directly into some ETL examples by handling some small sample files. 1)   What is Parquet files: What is Parquet? Parquet, an open source file format for Hadoop. Parquet stores nested data structures in a flat columnar format. Compared to a traditional approach where data is stored in row-oriented approach, parquet is more efficient in terms of storage and performance. Why Parquet? Parquet stores binary data in a column-oriented way, where the values of each column are organized so that they are all adjacent, enabling better compression. It is especially good for queries which read particular columns from a “wide” (with many...

Java (Chapter 2.2) : Nested IF

For any type of consultation, query or doubt. You may contact to the following: (+91) 9804 436 193 debabrataguha20@gmail.com  and join the group  https://www.facebook.com/groups/331231161093613/ Hope the basic understanding of conditional statement is clear now. But if you are starting your java learning from this chapter only, I will request you to go through 1 st 2 chapters and solve the exercises. Then only come to this chapter. So let’s go to a program which we discussed earlier in the previous chapter ( Starting IF ). It will be like, user will give the month-number as input and program will return the number of days as an output. Input                     output                  why? 1                 ...