Warning: base64_decode() expects parameter 1 to be string, array given in /home/eheupel/eclecticechoes.com/wp-content/plugins/askapache-google-404/askapache-google-404.php on line 156
Computers » Eclectic Echoes
Skip to content

Category Archives: Computers

Google Insights – Pop Culture vs. Environmental Conservation

Roger Harris of Saving Species, put up an interesting post that seems to provides support for the hypothesis that most people care more about the latest pop star, fashion, Hollywood marriage / divorce or some other trivial fact from pop culture than about conservation issues. He used a plot showing Google searches (as a proxy for “Interest”) for “biodiversity” versus various pop culture terms:”Justin Bieber”, “Lady Gaga”, “Britney Spears”. It was unsurprising that the pop culture terms were searched more often than “biodiversity”, but it was a bit surprising how much more. As he points out:

Relative to searches for Justin Bieber, et al., the number of searches for biodiversity is essentially zero. Even if we take Britney Spears, whose star is fading, searches for her name during 2011 outnumber those of biodiversity by about 23 to 1. That is, for every person who searches for biodiversity, 23 are looking for information about Britney Spears.

Pretty disheartening result, I must say, at least on the face of it. Roger is a bit concerned about the result as well.

But what does it mean? Do people care more about one rather mediocre pop singer than they do about the dwindling variety of life on the planet?

I think it’s a an interesting exercise, and the results fit what we might expect, but on thinking about it, I would not give the results much weight in answering any questions save “What term is searched more?”

I was a little concerned about the biodiversity search term as a representative of “conservation”, and the acceptance of this search as, shall we say, a positive interest equal across all possible search terms. How many people do we expect to search for “biodiversity” compared to searching for “Britney Spears” and how do people search for those two conceptually very different terms?

Since he started with the argument of people caring more about pop culture more than conservation issues, I would have liked to see a comparison using more than just the term “biodiversity”. Biodiversity may be the heart of conservation, but I think it may not be the first term to come to most peoples’ minds when thinking about conservation. It is a relatively recent term, and while widespread in science, conservation and management communities now, is it as widespread in the public? And perhaps more to the point, is it understood by the public that biodiversity is the core of conservation? Perhaps it would be more representative of the spirit of the comparison to use a series of conservation oriented terms such as “biodiversity”, “endangered species”, “wildlife conservation”, etc., instead of just the term “biodiversity”.

The graph above (and all to follow) begin at January 2010 and extend to March 2012. The results are normalized to the highest data point (which becomes 100). Pop culture again has far more searches than a collection of conservation related terms.

Maybe I am wrong, but I think the primary group of searchers for “biodiversity” are likely students. Someone looking for an answer for a homework question for 6th grade science class or studying for their finals in Bio101 in college. What would a student search for? To me the most logical searches would be “biodiversity definition”, “biodiversity examples”, “measuring biodiversity” and plain old “biodiversity”. Other people besides students would surely search for biodiversity, but I think that it’s a term that you really only search for once or twice. For most people it has an answer. “Biodiversity is …”, “Biodiversity is measured by ….”, etc. Sure, there are a small group of people, most likely people in ecology or conservation, that may search biodiversity heavily as a keyword for news items, etc., but I believe this would be a very small group.

Using Google Insights again, we can check the context of the searches for biodiversity:


What about pop culture references? Here I think there is far greater diversity of major searches. Of course, I am not well tuned to pop culture, so my thinking may be biased, but I see least three major, specialized search groups. First are the groupies searching for the latest information on their pop idols, just in case the sites they frequent aren’t up on the absolute latest gossip. This group is likely responsible for the greatest “per capita” search effort. I know 2 people who search for information on Lady Gaga two to three times daily. The second group of searchers I can see would be the “normal” fan base. They probably search a couple of times a month to perhaps daily. They are looking for latest concert announcements, lyrics to the new songs, and perhaps a bit of gossip. The third group would be folks, like me, who have (ok, had) no idea who some of these pop culture people are (yes, I admit, I had to Google both Justin Bieber and Lady Gaga), but are prompted to find out. I have no clue what a groupie might search for, but I can see the wider, normal, fan base searching for “lyrics”, “concert”, and similar terms. Those of us that are, shall we say pop-uncultured, the most normal search would be for the name only. For this last group there is interest in the search term, but often it is only to figure out of what it is that we are ignorant. I will readily admit, my interest in Justin Bieber, Lady Gaga, et al. was completely extinguished as soon as I saw the search results. Those first two groups, though, will search repeatedly on the same term.

Using Google Insights to check the context of the searches for (ulp!) Justin Bieber:


Google has equated a search as being equivalent to interest in the search term. I think it is logical to use interest defined this way to compare terms from similar categories. Comparing a pop star to a politician and a public scientist is a fair comparison. Comparing philosophies such as capitalism and socialism is a fair use of interest through Google searches. In the case of biodiversity, comparing a similar idea, such as ecosystem, would be a logical use. Just for fun I ran a few Google Insights comparisons keeping the search terms from similar categories:

Compare “Capitalism” and “Socialism”

How about comparing “biodiversity”, “ecosystem”, “genetic diversity”, “species diversity”

Notice the cycle for both “biodiversity” and “ecosystem”? I think this supports my hypothesis about the major search groups for biodiversity being students. There appears to be a strong periodicity that correlates to the school year. Low “interest” July and August, then sharply higher September through the end of November. December break? Sharp but short decline. January to April high again. May? Even higher (finals anyone?) then trailing off for June as the trimester and K-12 schools taper on out. Fits my hypothesis, but really need to look through a longer time series, and compare the pattern to other searches that should and should not be correlated with school activity. Is there a general decline in Google searches in the summer and Christmas holidays because of people spending more time away from the computer? Does that alone explain timing and magnitude of the pattern seen.

Finally comparing “Bieber” : “Obama” : “Santorum” : “Hawking” : “Darwin”

While I think comparing the number of searches for terms of similar category (e.g. concept, person, activity, place etc.) is a valid way to compare some level of interest, I think it’s getting on shaky ground to compare interest in an idea such as biodiversity, with a pop star, using the number of searches conducted on Google. It’s a situation of comparing apples and broccoli. It seems to me that the underlying assumption that the number of searches for a term is an equivalent measure of interest for any possible search term, allowing comparisons across broad categories is flawed. But then, that is often a tricky issue with proxies—understanding the assumptions and what they mean for where the proxies do and do not work.

Where I do think this is an interesting and potentially useful tool is sort of at the interface of Search Engine Optimization (SEO) and outreach. For an outreach organization understanding how people search for the concepts you provide outreach for is important. Often the terminology used by people who are experts on a subject is very different from terminology used by those who are familiar with the subject, which is very different from the terminology used by those who are entirely unfamiliar with the subject. Understanding what terms are often used by the different groups and how they search for them allows an outreach group to craft their outreach so that it is more discoverable for their target audience and allows better communication (& hopefully positive impacts) with that group. This is the type of work we used to do with large focus groups and surveys when I worked in industry. While I think it’s still a good idea to use either a small focus group or survey to get seed data, with Google Insights online the same results could potentially be accomplished with carefully planned Google Insights mining alone.

Guest Blogging Fun

The past few weeks turned out to be pretty busy for me online (at least compared to the past oh, 24 months), though from this site alone, it may be hard to tell. Since much of my posting has been at other places I thought a quick summary of the months posts (and links) would be helpful for those interested:

  • Sergeant Major (Abudefduf saxatilis) – A post at Larval Images about one of my favorite ecosystems and one of the great juvenile fish that are commonly found there.
  • Budget Hacking – A post here about the importance of NOAA for the myriad jobs they do, many of which are important for public safetly and economic security in addition to research. A post that I feel is a very important read, especially as the 2012 budget fight may still cut NOAA deeply, including the satellite’s needed to track and predict severe weather events as well as we do. Without NOAA’s work I think it is safe to say the Alabama death toll would have definitely  been significantly higher.
  • Gulf of Mexico Dolphin Mortality Event – Posted as Scientist in Residence at Deep Sea News – in which I use the data from NOAA to take a more slightly more detailed look at deaths of dolphins since the oil spill in the Gulf, and explain the box and whisker plot.
  • Dolphin Whiskers – now only Babies – published a few days later here, to address the concern that there is a higher that normal number of babies washing ashore, but the graph, as presented by NOAA and in the MSM, does not really support that claim. So again come out the box and whisker plots.
  • My ‘Seascape of Fear’ – A second posting as  Deep Sea News Scientist in Residence, I discuss my recent trip to Belize as a teaching assistant for a coral reef fish ecology class and the arrival in Belize of the highly invasive Lionfish.
  • How does a floating plastic duckie end up where it does? – A guest Blog post at Scientific American Part of a four post series on drifting junk in the oceans and how, sometimes, they can help us explore and learn more about the ocean currents. Other posts in the series include a review by Lindsey Hoshaw of the book Moby Duck (I’ll post my own review here later, I liked the book quite a bit more than Lindsey), an interview by David Manly with Moby Duck author Donovan Hohn, and a Matthew Garcia review of Dr. Curtis Ebbesmeyer’s bookFlotsametrics and the Floating World about tracing accidental drifters and the information they can give us.
  • Is It Time to Relax Fishing Regulations? – Another Scientist in Residence post at DSN, this was a response to fisheries biologist Ray Hilborn’s recent op-ed in the New York Times advocating a relaxation of the current fishing regulations. (Enric Sala, Peter Singer, Daniel Pauly and Mark Kurlansky all replied to the paper.)
  • Finally, Reflections, posted here, in which I examine where I have been, where I am and the options going forward.

Hopefully in the next few days we will have a guest posting or two here by Johann. Discussing some of his recent adventures and science from his point of view.

In addition to the postings there are several new YouTube videos I uploaded in the past weeks, mostly of the underwater variety.

Reflections

As I begin the home stretch for my MSc in oceanography I have been looking very hard at the job markets and the world of research science out there. It is not pretty right now, but then, with the help of a loving and supportive family we’ve weathered this type of climate before. I know we’ll find our way through this and come out the other side, happy. Because that is just what we do Tammy and Johann and me. We cling tight to what matters most – each other and our closest family and friends.

One of the things I have to keep coming back to is “What do I want to be when I grow up”

But that has never been a simple question. When I was young I wanted to be a scientist and an explorer. I was excited and my imagination ignited by archeology, marine biology and the space program. I devoured national geographic magazines, Wild Kingdom and Jaques Cousteau’s specials. I remember reading and re-reading the articles by Dr. Eugenie Scott on the amazing fish of the Red Sea and sharks in general. I remember reading about Dr. Sylvia Earle’s descent to 1250m in a hard suit and her Tektite mission. I know it may sound crazy, but one of the highlights of my brief science career so far was to dive on the Aquarius site as a science diver… the descendant of Tektite, it was, part way to an old dream come true – to live and work in an underwater habitat studying the seas for hours and hours at a time. One day I still hope to make that dream come true.

Neemo9 aquarius

Science diver approaching the NOAA/UNCW Aquarius Habitat off Key Largo, Florida

But my path took a strange turn and instead of going to Woods Hole or Scripts or Harbor Branch, I ended up in the Army working as an advanced communications specialist using, trouble shooting and fixing just about every type of communication technology in the Army, but specializing in satellite systems. It could be a challenging job, especially in remote combat deployments, but it really didn’t make me stretch. I spent my spare time reading and improving my animation skills as a form of entertainment. In Central America I learned to scuba dive and spent as much time on Roatan Island as I could, doing 3-4 dives a day. The more I dove, the more I needed to learn about the fish and invertebrates I was seeing. I subscribed to several diving magazines and bought every marine biology book my scuba instructor could get from the States. I invited my future wife to meet me in Roatan, unfortunately she declined.

IZE Sunset

Sunset on the Meso-American Reef. Copyright E. Heupel

After the army I worked in the computer industry in engineering and eventually web development until the bubble burst. When that happened I returned to school, studying computer systems and graphic design. Unfortunately returning to school also revealed that I had a memory issue. Tammy knew before, but I denied it of course. Unfortunately the tricks I had learned to use on the job, didn’t translate well to the academic environment. I struggled to find a new way of learning and studying, while my grades sank, eventually forcing me to admit defeat temporarily as I withdrew from school.

Fast forward to five years ago when I took advantage of an opportunity to again return to school. This time in Oceanography. I had since learned to deal with my memory issues with new strategies. I started slow, with only two classes, but soon took on a full course load completing the four year degree in three and a half years with a job, a family and still managed a 3.5 GPA. My old skills in electronics, optics, video production and web design all served me well working in labs and earning me opportunities to work with Remotely Operated Vehicles. At the end of undergraduate I knew I needed to take this further, I needed to revisit my old dream of being a scientist working in, on and under the ocean.

Motley Crew

The motley crew of the SHRMP 2010 habitat monitoring program mission. Copyright E. Heupel

I was accepted to the graduate program and began learning more about sustainable fisheries and GIS than I thought was possible to learn (and yet I have still only learned a spall portion) . It has been a good run, but now it is almost over. I want to go on further, but I know I need a change in direction. My interests lay more with larval and juvenile marine organisms and their ecosystem roles (besides the stock answer I get from many: “as food” – too damn easy), or in the ecology of deep sea and mangroves and with invasive species in connection to any of the previous. I have at least a hundred questions banging around in my head, and I am loathe to even try to pick only one and say -> This is it.

Juvenile Sergeant Major

One of my favorite fish of the mangroves is the juvenile sergeant major. Very cute, shy and nervous - darting constantly around the patch of mangrove they call home. Copyright E. Heupel

More than that there is the question of what good is a PhD, and is the cost too high to justify. I have put my family through a lot already. It has been financially very hard, and we have done without a lot. I have been fortunate that this program knows me, and knows the type of contribution I can make, and also understands that my family is the most important thing in my life. I will never be one of those scientists (or PhD students) so driven by the research that I sacrifice my family (which I have seem too much of in the past 5 years). Driven yes. If I had a spare $10,000 right now I would be on a plane to Belize to chase down one of my burning questions on invasive species and My Seascape of Fear (actually budgeted with no salary it a hair over 10,000). But I’m not going to throw my family under the bus to get there.

Which brings it once again back to what I want to do with the degree. I would like to be able to design and conduct my own research, which I would need a PhD for. I enjoy teaching small to medium size classes, as long as there is at least one or two kids turned on to learning. At a University or college a PhD is generally the ticket for admission to that. At community colleges, a PhD can be required, or a hinderance.

As for the most singlehandedly enjoyable thing I have done in the past 5 years – it would be the outreach efforts at Aquarius. Doing the science, putting on a live show, broadcasting it to kids in their classrooms and online – both doing science and helping to communicate it to a larger audience. That was for me a real rush. Many of the people involved in that team effort did not have PhD’s, but then again many did. I enjoyed the fact that we were communicating conservation, physics and biology directly and passionately to an audience eager to learn.

A Magnificently Motley Crew

The marvelous crew of the Aquarius 2010 If Reefs Could Talk mission. Copyright E. Heupel

If I stopped right now, my ideal job, would be either as a freelance science communicator specializing in video and online production or it would be with one of the NURC centers or a similar scientific research organization or NGO where I can put my myriad skills to work – oceanography, diver, science outreach, video, animation, web, database, photography (normal and U/W) and ROV pilot (in training right now). But… likely I would not be able to do my own research, which is important to me.

If I were 23 and single, the answer for me would be easy – go for the PhD and study larval and juvenile ecology issues, especially in the mangroves and deep sea. But I’m not 23, or single. And I wouldn’t trade my family for anything, but it does mean I need to figure the 4-6 years of making (if I’m lucky) $30,000/yr while working very long hours into the equation.

Dolphin Whiskers – now only Babies

Para-sight recently wrote a post at DSN urging caution in assigning blame to the BP oilspill as the cause of recent cetacean deaths along the Gulf Coast. To try and address one of the issues he raised, I wrote a guest post there using the raw data from NOAA to quickly try and get an idea of how bad the dolphin situation in the gulf really is when examined in the light of the historical data available. Unfortunately the graphs NOAA created comparing 2010 and 2011 to historic data made it hard to truly evaluate as they used the average value for each month for the historic data. Based solely on their presentation of the data all that can really be determined with any confidence is, as Para-sight noted:

…dolphin deaths are not that rare to start with and that this is the time of year for it (i.e. it’s not actually “unusual”, it’s “seasonally-appropriate”). Secondly, it means that it’s not much worse in magnitude than last year, albeit peaking a little earlier.

and

But in these cases, the devil IS so often in the details. I am not saying oil wasn’t involved; I’m just saying we can’t infer causality from correlation without a closer examination of the data.

Running the raw data through excel (to subtotal & clean) and R and I could see more of the information Para-sight warned was needed to get a better feel for the situation. And sure enough I was able to add to his observations that:

the pattern of the event does strongly suggest that the oil spill is related, but looking at the prior events, it is not outside the realm of possibility that this is not directly related to the oil spill. The best evidence to determine the involvement (or lack thereof) of the oil spill is going to be found in necropsy and toxicology reports of the stranded animals.

 

In the meantime however several people commenting on the original post pointed out that the biggest issue was with the a specific age class of dolphins. As “Dr. Ed Cake” commented:

…In the meantime, this year’s neonates continue to wash ashore dead in the BP spill zone and all potential mortality factors can be dismissed except the impacts of BP’s oil and Nalso’s Corexit dispersants.

and “Don” stated:

NOAA provides data specifically for “baby dolphins.” It shows a five-fold increase in mortalities in 2011 compared to 2010.

So that last bit especially really made me wonder. The more detailed data showed there was an unusual mortality event (UME) happening with cetaceans in general, though it is not clearly linked to the spill. But a five fold increase for one key age class??

 

So tonight I journeyed forth again to the NOAA website, and found on scanning down to the bottom of the page, there is a table of “baby dolphin” deaths as Don phrased it. It is more accurately “premature, stillborn, or neonatal dolphins”. NOAA used the cutoff of 115cm total length to split out these deaths and presented them as both a table and graph:

neonatal, stillborn and premature dolphin deaths

Graph of all neonatal, stillborn and premature bottlenose dolphins

As both commenters pointed out there seems to be a major issue going on here! Just look at February! a 36x increase from 2010 (pre spill) and an almost 18x increase in deaths over historic averages. And March is better, but still a 2x increase over last year and 3x increase over historic levels.

However, here again we are given historic numbers solely as the average. As Para-sight warned, and we saw with the complete cetacean data set, that is a dangerous way to look at data like this.

But what about the “baby dolphin” data?

From the truncated graph NOAA presented it appears there may be a seasonal pattern (not unexpected) but it is hard to tell with out the complete year graphed. It is also not possible (again) to tell what the range of values is for each month. The numbers 100 and 2 average out to the exact same value as 52 and 50 do, but the range of values, and the story it tells, is quite different! Back to the data set, with definition in hand I extracted all bottlenose dolphin records with lengths of 115cm or less and duplicated the analysis I did on the full set over at DSN. The resulting table:

Gulf Coast Strandings of Bottlenose Dolphin <115cm

First thing to notice comparing the two tables is that yes February deaths were an order of magnitude higher than the historic range.

The second thing to note is — they don’t match. Not just that I included the entire year in the table, but the numbers for the averages in each month they did include don’t match. They only averaged 2002-2007 for their historic baseline. So I redid the table using only 2002-2007, still the numbers were slightly off. Using their dataset and the definition for premature / neonatal / stillborn they gave, I get different numbers than they do. Aggravating. I am comparing historic data to the 2011 data, I double checked all the subtotals are correct and redid the summary table creation three times each time with the same result. I have verified that the data in the table is true to the definition given by NOAA and the raw data NOAA has provided. I really don’t like omitting data without good reason (which was not provided by NOAA) so I finished the evaluation keeping the 2008-2009 data in:

Box and Whisker plot of Stranded Bottlenose Dolphins less than 115cm. 2011 data blue diamonds, 2002-2010 data boxplots with outliers as open circles.

So what does this one tell us? And is it different from the conclusions inferred from the NOAA graph?

Based on the NOAA graph and table one would assume that all months except April were UME months, and April, one could easily argue, is on it’s way to being a UME month as well, after all, that is only 10 days of April data. However, looking at the box plots only January and February really stand out as a UME(s). February is clearly an outlier, in fact the most deaths and the most radical outlier of the entire data set. January is an outlier but is it a UME? I guess that depends on the definition, but it is two standard deviations above the mean January number of deaths, so I would say it is a UME. While March is high, it is still within the range of expected numbers from the historical data. Granted March is right at the upper edge of that range, but it is within that range nonetheless. If we were using Para-sight’s suggestion of using values greater than two standard deviation above the mean as the definition of a UME then only January and February would be as well. However when taken with the previous two months I would suggest that March is a continuation of the same UME.

As for April, with only 10 days of data for the analysis, it is right at the edge of the expected range of values from the historic April data. One could argue, I suppose, that it will climb much higher, but without the actual numbers any discussion of that is pure speculation. I expect that it will climb as well, but we’ll just have to wait for the rest of the data to come in to really say anything about it conclusively.

What is more interesting to me though than the month by month numbers is the pattern, both historic and for this year. Historically, just as with the larger population, bottlenose dolphins < 115cm have a clear pattern of strandings. The late winter and spring (January through May) is the period of strandings, with most strandings clustered in the February through April period with the sharp peak in March. The rest of the year have much lower numbers. The box plot and the full table also reveal that each year it would be unexpected to find no strandings under 115cm but in the summer through early winter (June through December) it is actually an unusual event to find a stranding, let alone more than one.

This is a direct clue to the reproductive biology pattern of the Bottlenose Dolphins in the Gulf of Mexico. I have very little knowledge of dolphins so had to turn to some local experts for help (contrary to popular belief all ocean scientists DO NOT work on dolphins). Turns out that Bottlenose Dolphins generally mate from late March to May and have ~11-12 month gestation period. At birth a bottlenose dolphin is anywhere from 85-110 cm long (the reason for the 115cm cutoff). Looking at the length records the smallest animal found was a January 2006 stranding at 40cm. Fully 83% of the strandings were of individuals between 85 and 110cm which combined with the seasonal pattern would corresponds essentially to very late term neonatal or newborn aged individuals. Examining the length vs. month of death did not provide any real patterns except that March had the widest range of sizes in the strandings, which is not too surprising as March had significantly more deaths than any other month in the aggregated data.

This year however the peak is clearly in February. Why? Unfortunately I did not see any length data for this year, just summary numbers. Were these individuals mostly less than 85cm? That would infer prematurely born babies or late term miscarriage. If they were mostly in the 85-110cm range? That would suggest they were from the early mating season, likely conceived before the oilspill, and at full term.

So, yes , there is a major unexplained mortality event that is causing a significantly higher number of neonatal, stillborn and premature Bottlenose Dolphin strandings. February’s mortality is the worst event for individuals under 115cm on record, by a fair margin. I expect the number for April will go higher, quite possibly being an outlier. From the timing, the oil spill is certainly a suspect for being a cause. That does not however mean it IS responsible. There is still no way to know the oilspill’s relation to these deaths without detailed necropsy and toxicity study results.

Some R + Google magic

Nothing much to see, just me making sure I understand everything going in in the post over at R-Ecology to use Google Visualization API. Using this we can get excellent presentation of data like Hans Rosling’s brilliant TED Talk. Took the fish harvest data set and added a new column for trophic level (using fishbase for species I didn’t already have literature references for). First the basic modification to make sure I understood the subsetting etc.

Be sure and play with the axis and selecting species etc.




Data: trophdata2, Chart ID: MotionChart_2011-01-23-00-26-21

R version 2.12.1 (2010-12-16),

Google Terms of Use

Generated on Sun Jan 23 00:26:25 2011 with googleVis.
Powered by R.rsp v0.4.1.

This would be sweet to use on the Vinal Edwards data! I know i still have it around here somwhere… or the NEBO data?

Next add subsort by price per pound and trophic levels then….[edit] done!

Eclectic Echoes is Stephen Fry proof thanks to caching by WP Super Cache