Thinking like a Data-Driven Guy for Dropbox

dropbox

When I used Dropbox for the first time from my Linux box, it was a shining moment for me. In that time, I was looking precisely for a solution for my files that I used to let behind always in my USB memory. For every Linux user, which many of them loves Open Source software; collaboration is an important issue, and Dropbox came to save my work a lot of times, because the platform itself, is a synonym of collaboration, and this is one of the reason why I love the platform.

The other reason why I love the platform, that they use my favorite programming language for the core development of the proprietary synchronization daemon: Python, and 2012, Guido, the creator of the language was included in Dropbox’s payroll: Just awesome !!!

So, I want to do my little contribution to the platform, writing some ideas how to improve it and the business itself. I will divide this in some key points:

  • Improve blogging frequency in Tech’s blog about Data Science at Dropbox
  • Improve user engagement in Mobile devices using Localytics services
  • Hire to Greg Nudelman like consultant to improve Dropbox for Android, and work with Mailbox’s team for Android-based version
  • Build a high class Data Science team to get more useful and better insights from Dropbox massive data sets
  • Improve Marketing efforts using Inbound Marketing techniques focused on Facebook, Google Plus, LinkedIn, Twitter, Blogging, ebooks, etc

Continue reading “Thinking like a Data-Driven Guy for Dropbox”

Advertisements

DataStax Enterprise 3.0: A synonym for High Secure Real-Time Analytics

DataStax

Some days ago, I had the pleasure to talk with two Apache Cassandra experts. The first was Edward Capriolo, a Hadoop System Administrator at Media6Degrees, organizer of the NYC Cassandra User Group and NYC NoSQL Meetups, author of the incredible “Cassandra High Performance Cookbook” book and one of the DataStax´s MVP.

The second was the same Jonathan Ellis, DataStax’s Chief Technology Officer and co-founder, who leads Apache Cassandra’s project too.

Continue reading “DataStax Enterprise 3.0: A synonym for High Secure Real-Time Analytics”

Why Red Hat need partnerships with Cloudera, MapR, Intel and OpenDNS

partnership

Yes, I know. I´m a little crazy young man, saying to Jim Whitehurst, current Red Hat CEO and their management team all these ideas, but who knows? Perhaps, some of these ideas are not so crazy, and they could be implemented. But I will let that responsibilities to the board.

My ideas are focused on two key needs for many organizations and companies today:

  • Apache Hadoop: the de-facto platform for Big Data Analytics and its relationship with Cloud Computing
  • Internet Security: a serious problem today for companies, governments, and global organizations
  • Apache Hadoop’s Security: a very discussed topic by customers, developers and System Engineers which needs a solution right now

Continue reading “Why Red Hat need partnerships with Cloudera, MapR, Intel and OpenDNS”

Choosing a MPP database is incredibly hard

Like the title says, to choose an enterprise-level Massive Parallel Processing (MPP) database is actually a big headache for every Data Science Manager; basically because there are very good choices around the tech world.

Continue reading “Choosing a MPP database is incredibly hard”

Fighting Cybercrime with Splunk Security Analytics

Splunk

There are many industries which are in total explosion: Real State, Marketing Analytics, Retail, Recruiting Services, Big Data Analytics; but these are the good guys. There are other guys which are using its deep knowledge about Security, Hacking, Cracking, Phishing to take advantage of the popularity of these industries to cut a big slide of the pie and make money from that. A new kind of business have born: Crime as a Service (CaaS).

Continue reading “Fighting Cybercrime with Splunk Security Analytics”

MSBA from NYU Stern: Born to shape Data Science Leaders

NYU Stern School of Business

Some days ago, I wrote a Tweet message that changed my perspective about Business Analytics. That message was sent to Anindya Ghose (@aghose), a professor of the well known New York Stern School of Business and Co-director of the Center for Business Analytics in the same institution. He is an expert in Social Media Analytics, Advertising effectiveness (one of the my favorite research fields), Mobile Advertising market and a lot of amazing stuff that you will amazed for his commitment.

But, he is not alone. With him, there are other brilliant minds working all together to create an amazing Master degree in Business Analytics. Who are? I will mention just two: Sinan Aral, another research leader in Social Networks, Social Media and Digital Strategy, and a Microsoft Faculty Fellow, who wrote an amazing article about the Power of Influence in Social Marketing in the Harvard Business Review, and the professor Vasant Dhar who is the Head of the Information Systems Group at New York University Stern School of Business. Dr. Dhar’s research is focused on Data-driven predictive analytics in Finance, Healthcare, and Social Phenomena. Continue reading “MSBA from NYU Stern: Born to shape Data Science Leaders”

Why Real-Time Analytics matters

Real-Time

Many of you, my good Data Science fellows should be hearing about Real-Time since from several years before, but we are in the Era of Information, and in the years of Big Data, and changes happens so quickly that you need to adapt very fast to support the big wave of information. In Analytics, it’s happening the same thing: because if you can answer smarter questions in seconds, you will be able to react quicker to these changes and that’s really matters in these rush times, my dear friends.

I was reading yesterday a great blog post from Derrick Harris, the well known technology journalist from GigaOM where he exposed some good points about Spark, the great technology which is been developed by AMLab from the University of California, Berkeley. But it’s not just Spark, there are some good pieces of technology which are disrupting Analytics field for good. I will try to put you some of my favorite platforms in this post, but I don’t want to repeat information, so I will write just little things and amazing quotes of each platform. Let’s begin. Continue reading “Why Real-Time Analytics matters”

Tableau Software: Rewriting the Story of Data Visualization

Feeling like a Storyteller fan

Storyteller

When I was a kid, I sat every afternoon, exactly to 6:00 PM a clock to watch another episode of a great serie created by Jim Hanson in 1988 called: “The Storyteller”, where an old man (John Hurt) and his funny talking dog (Brian Henson) recreated the best fabbles around the world, and I just saw and heard that with a great entertainment, that time passed quickly, and I began to count hours to see the next chapter in the next day. John’s voice was very quite and full of kindness, and every child in that time loves all stories told by the great storyteller.

You should be wondering why I began in this way. That has a single answer: I get the same feeling when I see a great infographic or a data visualization created with great tools like R in combination with ggplot2, the great matplotlib library from Python, or the amazing Tableau platform, because for me, the work of a Data Scientist is just one:

“To tell a great story behind numbers and facts with annoying graphics to say more than simple words and sofisticated statistics methods. To make it simple: You have to be a great Storyteller like John.”

But how do you do that quicker and easier? How do you build great data visualization in matters of seconds? Like I wrote before: Time, united to your mind, are the most important resources that you have, and you have to use them wisely, so I have an answer for you, my friend: Use Tableau Software. So, you should be wondering the 1 million question: WHY? Keep reading. Continue reading “Tableau Software: Rewriting the Story of Data Visualization”

Data-Driven Games Development Rockstars

Gaming

I have to say it my friend: I´m an avid gamer. Big players like EA´s Activision with its Call of Duty Black Ops II, Ubisoft with its amazing Assassin Creed franchise, Blizzard with Diablo III, Konami with Metal Gear Rising, Microsoft with Halo 4, Bioware with its incredible Mass Effect 3, and many more good amazing names are gathering a lot of attention for gamers, but there are another kind of games that are capturing gamers minds: Social and Mobile games.

Today, there are some teams which are disrupting gaming industry for its novel way to create engaging and cool games. And How do you think that companies like Wooga, Rovio, Supercell are winning this hard game? Using Game analytics.  Yes, my friend, using cool things like A/B feature testing,  customers analysis, MAU performance, DAU predictive analytics, and many more techniques. Continue reading “Data-Driven Games Development Rockstars”

Data Scientists: the world need us

Data Science

Some months ago, I wrote a post dedicated to new Data Scientists, giving my personal recommendation about several books that are pure gold, and great tools like Python, R, and Apache Hadoop. Right now, today is a new day for this kind of professional; yes, because, the Harvard Business Review (HBR) published a great article talking about the Data Scientist, written by Thomas H. Davenport and D.J. Patil; and I think that both did an incredible job in this writing, believe me, you should read it, you will not regreat. So, I want to dedicate these lines to the raising quantity of jobs with a shining title: “Data Scientist”. If you look today in any Job Board like Linkedin, AOL Careers , Indeed, SimplyHired, Technology Ladders or Dice, and you do a little search about this title, you will find more than 250 new open positions everyday, doing only the search in U.S. If you expand the search to more countries like UK, Germany, Ireland, India, China, Netherlands, the numbers grow like a completed madness. Continue reading “Data Scientists: the world need us”