Kimball Forum
Would you like to react to this message? Create an account in a few clicks or log in to continue.

How to implement a data warehousing solution for Google Analytics data?

2 posters

Go down

How to implement a data warehousing solution for Google Analytics data? Empty How to implement a data warehousing solution for Google Analytics data?

Post  nkaur301 Tue May 18, 2010 12:15 pm

I have click stream data such as referring url, top landing pages, top exit pages and metrics such as pageviews, number of visits,
bounces all in Google Analytics. There is no database yet where all this information might be stored. I am required to build a data
warehouse from scratch(which I believe is known as webhouse) from this data. So I need to extract data from Google Analytics and
load it into a warehouse on a daily automated basis. My questions are:-

1)Is it possible? Every day data increases (some in terms of metrics or measures such as visits and some in terms of new referring
sites), how would the process of loading the warehouse go about?

2)What ETL tool would help me to achieve this? Pentaho I believe has a way to pull out data from Google Analytics, has anyone used
it? How does that process go?

3)How does Google Analytics interface with Pentaho and in what ways can you use the features from Analytics right inside Pentaho?

Any references, links would be appreciated besides answers.

nkaur301

Posts : 1
Join date : 2010-05-18

Back to top Go down

How to implement a data warehousing solution for Google Analytics data? Empty Re: How to implement a data warehousing solution for Google Analytics data?

Post  ngalemmo Wed May 19, 2010 11:03 am

As far as question #1 goes, sure, of course it is possible. I am not familiar with what Google provides, but from what you describe, it sounds like aggregate data. I've developed clickstream warehouses that maintained individual page views without significant challenges that had fact tables in the many billions of rows with good performance.

As far as ETL tools go, there are a lot of products out there, most of them good. Choose one that you can afford and provides the functionality you need.

As far as interfacing goes, how does Google provide the data? If it is just a download, where they provide a flat file or XML file, most ETL tools should be able to handle it.
ngalemmo
ngalemmo

Posts : 3000
Join date : 2009-05-15
Location : Los Angeles

http://aginity.com

Back to top Go down

Back to top

- Similar topics

 
Permissions in this forum:
You cannot reply to topics in this forum