Schedule All

This page contains all the daily schedule pages from the whole semester.


01-Introduction

by samooresamoore (27 Sep 2008 15:01)

09/03/2008

Description of what the course is about, what we'll do in the class, why students should take it, why all their friends should take it.

My notes

  1. Introduce myself.
  2. Go through the class pitch
    • At the end of the pitch have a discussion about the merits of the class, what they're thinking, what they think sounds good, what sounds confusing.
    • My slides (slides at SlideShare)
  3. Take role; use the Photo Roster
  4. Course wiki
  5. Discuss interesting parts of class from my perspective:
    • practical
    • class wiki (that is open, evolves, contains forum & blog)
    • individualized learning
    • project-centric
    • your own wiki
    • revised from last year (projects, due dates, reading)
    • web-based almost completely
    • lots of small tasks; can't fall behind

To do

  1. Sign up as WikiDot member
  2. Apply for membership to the BIT330 web site.
  3. Read the following pages in this wiki: schedule, syllabus, assignments.
  4. Look through the rest of the course wiki.
    1. Choose every menu item in the wiki and see what it gets you. Note that I didn't say you have to read everything in the wiki — just know what's here so that you can find it at the appropriate time.
    2. You'll definitely want to note the discussion forum in case you have any questions.
  5. Sign up for class notes or questions (as described on the assignments page).
  6. Sign up for search industry updates (as described on this page).

02-Web search

by samooresamoore (13 Sep 2008 16:32)

09/08/2008

Discuss basics of Web search and why students should use multiple search tools (rather than just Google).

At beginning of class

  1. Take role.
  2. Convince someone to take notes and post them for today's class
  3. By now you should have done the following. This is not optional. This is not to do later. These sign-ups are due today.
    1. Become a WikiDot member
    2. Become a member of the the BIT330 web site
    3. Read through the major pages of the wiki and looked through the rest of it
    4. You should have signed up for class notes or questions (as described on the assignments page)
    5. You should have signed up for search industry updates (as described on this page).
  4. The structure of today's class is going to be fairly standard for the rest of the semester:
    • Start off with announcements and taking questions and comments.
    • Lecture for a bit. This lecture will be something of an overview and will provide the motivation and background for the exercises that you will complete and the assignments that you will have to work on.
    • Provide some time for you to start on your exercises (which will allow you to explore the specifics of the topic that my lecture introduces).

My notes

  1. Go through “At beginning of class”
  2. Make presentation with slides
  3. Go through the search syntax page
  4. Discuss the Search tool data assignment
  5. Talk about blogging
  6. Point out the exercises for today; they should start working on these as soon as I'm done talking.
  7. Point out the “To do after class” section on this page

To do after class

  1. Finish the Web search exercises.
  2. Read the Web search overview.
  3. Read Life before Google.
  4. Start working on the Search tool data assignment. It's due in one week.
  5. You should be very, very familiar with the search syntax page by the next class

Resources

  1. Life before Google
  2. Search syntax — for both Google and Yahoo.
  3. Today's slides

03 Wiki Instruction Day

by samooresamoore (13 Sep 2008 16:59)

09/10/2008

We'll go over techniques and tricks related to using this wiki, which will also be the host of your term project wiki.

At beginning of class

  1. See if there are any questions about the assignment that students got last class.
  2. Note that students should check the announcements on the Web site on Sunday evening, Tuesday evening, and Thursday evening. Shouldn't take more than a minute or two.
  3. Students have their next assignment today: Search tool overlap data. This is due a week from today. Explain the assignment.

My notes

  1. Today we're going to learn about wikidot, get an idea of how to use it, get more familiar with working with a wiki.
  2. Wikidot is the host of the course Web site, but it's also going to be the host of your term project Web site.
  3. You are a member of the course Web site, and you will be the administrator of your own term project site. This means that, while you have total and complete control over your own site, you also have the ability to edit and create pages (but not delete them) within this course Web site.
  4. I will expect that you will be a very good user of this wikidot site by the end of the semester. Maybe you won't be an expert, but you'll be able to make a wiki that is filled with properly formatted content and useful navigation.
  5. The exercises for today's class can be found here.

During and after today's class

  1. Complete all of today's exercises before next class.
  2. Think about what you might do for the term project.
  3. Next class you will turn in the first assignment.

Resources

  1. Videos and other resources about wikidot
  2. Today's exercises are here.

Questions posed during class

  1. What does the pound sign mean in the menu bar (top menu bar)?
    • It means that the menu item cannot be followed — generally it's a menu header or divider.
  2. How do you delete pages?
    • Use the "site options" button at the bottom of the page.
  3. How do you edit pages?
    • Use the "edit" button at the bottom of the page.
  4. How do you list the pages on the site?
    • Use the button on the side bar that says "List all pages".
  5. Can you have the top menu bar automatically update with the most recent blog entries (or whatever)?
    • Yes, and you should look at the code I use to generate the "Announcements" on the first page, and then you would have to put this code in your nav:top page.
  6. How do you put an image hosted on mFile on your page?
    • The instructions are discussed on this page.

04 Search Techniques And Strategies

by samooresamoore (16 Sep 2008 20:00)

09/15/2008

We go over several standard search techniques and strategies.

Class structure

  1. Go through “At beginning of class” info
  2. Lecture through the slides.
  3. Talk through the examples
  4. Go through “At end of lecture”

At beginning of class

  1. Today's office hours are cancelled; I have to take my wife to the hospital for a CT scan.
  2. Check who is doing class notes for today
  3. If you have questions, please post them to the class's discussion forum; I check it frequently — much more frequently than I check email on the weekends.
  4. Students should go over announcements made since the previous class
  5. Collect assignments due today
  6. Remind about assignment due in the next class
  7. Your first possible blog entry (on today's exercises) could be turned in next class (see the schedule for details on the timing of blog entries)
  8. Sentence added to the Search Tool Data Analysis assignment: Proper use of statistical tests certainly would strengthen your arguments.
  9. Industry updates:

My notes

  1. Special search syntax — This is the tool that you have at your disposal that allows you to target your searches on specific parts of documents. Since different text in different parts means different things and perform different functions, you can use these operators to raise the precision of your queries.
    • Full text search engines
      • Title — intitle:
      • Site — site:
      • Top-level domain — site:
      • URL contents — inurl:
      • Links — link:
  2. Unique words and phrases — The use of multiple unique words and phrases are a key both to reducing the number of documents that are retrieved and raising the precision of your queries. Further, using multiple words and phrases increases the chances of retrieving content-filled documents (that is, increasing the number of “meaty” documents).
    • They can be used to focus in on more specialized pages that would use those terms
    • Gather related words using summaries
    • Use search engines to find related words
      • Example at Ask.com (both “Narrow your search” and “Expand your search”)
      • Google
        • Google Suggest feature
        • “Related searches” at bottom of search results window
      • Yahoo
        • Yahoo Search Assist feature
        • “Also try” at top or bottom of search results window
      • Yahoo Directory (we'll cover this in a future class) can point in the right direction
    • Use means queries
  3. Query specificity
    • Narrow to more general: this is when you have a real good idea of what you're looking for.
    • More general to narrow: this is when you don't know what you're looking for.
  4. Alternative naming
    • People
      • Using different name forms can return different information
      • Sometimes you have to use other information to differentiate two identically named people
      • Also, search specifiers can help target the information (intitle, site type, include, exclude)
    • Places
      • Use addresses (streets, zips, area codes, phone numbers)
      • Use "official"

In-class examples

Special search syntax

  1. Tigers
  2. Tigers but not Detroit Tigers.
  3. Information from an organization
  4. Information from an organization or a government
  5. Information from a zoo

Unique words and phrases

  1. Bunch of birds example
  2. Use "means" and "definition" queries: Hydrocephalus
  3. Related words: Investment guidance
  4. Fun with quotes
  5. Lyrics

Query specificity

  1. Dog breed information
  2. Dog breed disease information

Alternative naming

People

  1. George Washington information
  2. Stephen Hawking (as a name example)
  3. Levi Strauss (since there are two/three of them)

Places

  1. Pizza places in Ann Arbor
  2. The Sears Tower (as a landmark)

At end of lecture

  1. Start working on today's exercises. The exercises are on this page. You should work on them for no more than another hour outside of class; we will have more time in the next class after the lecture to continue working on them before going on to that day's exercises.
  2. If you are late turning in today's assignment, you still should go through the effort of posting the information to the results page — the analysis assignment that you will be doing depends on having this information.
  3. If you are going to write a blog related to today's exercises, be sure to review the blogging guidelines before doing so.

05 More Search Techniques And Strategies

by samooresamoore (17 Sep 2008 18:44)

09/17/2008

We go through the exercises related to search techniques; we also discuss evaluating sources from the Web.

Class structure

  1. Go through “At beginning of class” information
  2. Lecture (but no slides today) going through “My notes”
  3. Work on exercises
    1. Finish the exercises from last class
    2. Then work through the exercises for this class

At beginning of class

  1. So, my wife has liver cancer. Either benign (hemangioma) or malignant. We don't know yet. We've voted for benign. She'll be having an MRI in the next couple of days (we hope). Just wanted you to know.
  2. You should know that I send email when I have something to say to a specific student between classes. For example, if I have a something to tell you about your assignment specifically or an upcoming industry update or whatever, I will send you an email. The email will have BIT330 at the beginning of the subject line. For this reason, you might want to check your email no less than every couple of days to see if you have an email from me. More than likely, you won't — but you might.
  3. If any of you wrote a blog entry, please turn it in now. Generally, if you have written a blog entry, just bring it up to the podium for me before class starts.
  4. Check who is doing class notes for today.
  5. Check who is doing the Industry updates for next class.
  6. Students should go over announcements made since the previous class.
  7. Content update:
    • We now have some information on the notes page and questions page. Even if people are not signed up for specific days on the class notes page, I would still recommend that you post notes and questions. The more good information that is on these pages, the more that I will be able to use this information on the tests, and the better that you will do on the test — as opposed to me coming up with some random, poorly-worded question that you have to guess on.
  8. Collect the assignment due today.
  9. I have significantly updated the write-up of the next assignment. It is due on September 29. Do not start working on this yet. The data still needs to be cleaned up a little bit. I will make an announcement on the wiki when the data is ready. Further, I will post an Excel spreadsheet containing the data.
  10. You should be thinking about the topic for your term project.
    • Be sure to read the description of the assignment.
    • Also look at the list of industry sectors that you might select.
    • On day 8, which is September 29, you are turning in the first status report for your term project. On that date, you need to have decided on the topic, you need to have discussed your topic with me, described the topic on the start page of your wiki, and updated your information (that is, indicated the title of your wiki) on the class wiki's list of student wikis.
  11. Another note for your term project. Your term project reports will include a section on information sources (as we will discuss today). Part of this will be an evaluation of the quality of the information sources that you identified. You will want to describe how you evaluate the sources, and indicate on the report your evaluation of each one of them. This will not be a separate deliverable but should be integrated into the final report.
  12. There are so many blog opportunities from these two classes (i.e., today and Monday). If you want to blog on both classes, you don't have to choose something from “last class” and then something from “this class”. These are both the same topic; you can use any two things you want to blog on from both classes. It doesn't matter if they were the same or different days. (Again, you don't have to blog today, or last class. But you'll have to blog sometime, and you might as well start sooner rather than later.)
  13. Questions about any or all of this?
  14. Industry updates
  15. Demonstrate proper formatting of a wiki document
    • Also point out proper naming of a document

My notes

Discussion

  1. Long term research projects, or more difficult queries, require another level of effort and analysis.
    • Gather and save as much information as you can.
      • Use information from the search results, page characteristics, and contents of the results pages.
        • Look for names, contents, concepts, URLs, page titles, unique words, dates, places, facts, etc.
      • Create a wiki site to keep information and links.
    • Sometimes finding a set of related nouns and unique names can help you find what you need.
      • Use Google Sets
      • Use the queries ["type of X"], ["there are * types of X"], ["compared to X"], ["X vs." OR "X versus"]
  2. Evaluate the potential validity of the Web page from which you get information.
    • Facets to evaluation
      • Location of the page
      • Speaker's identity
      • Speaker's motivation
      • Credibility of sources
      • Speaker's history
      • Speaker's reputation

In-class examples

Candy bars

Types of things

Resources


06 Web Directories

by samooresamoore (28 Sep 2008 02:29)

09/22/2008

While focusing on Web directories, we learn about several different ways of using the work of others to help find information on the Web.

Class structure

  1. Go through “At beginning of class” information
    • You should read the “On your own” stuff even though I'm not going to discuss them
  2. Lecture. This will involve
    • Constructing the diagram
    • Constructing the flow diagrams for
      • Mailing lists and groups
      • Web directories
  3. Work on exercises

At beginning of class

On your own

  1. Before this class you should have already finished the exercises from last class (and all previous classes)
  2. If any of you wrote a blog entry, please turn it in now. Generally, if you have written a blog entry, just bring it up to the podium for me before class starts.
  3. Check who is doing class notes for today.
  4. Check who is doing the Industry updates for next class.
  5. You should be thinking about the topic for your term project. I have only met with one student about his topic so far.
    • Be sure to read the description of the assignment.
    • Also look at the list of industry sectors that you might select.
    • On day 8, which is September 29, you are turning in the first status report for your term project. On that date, you need to have decided on the topic, you need to have discussed your topic with me, described the topic on the start page of your wiki, and updated your information (that is, indicated the title of your wiki) on the class wiki's list of student wikis.

I'll go over with them

  1. Go over announcements made since the previous class.
  2. Return the assignment.
  3. Return the blogs from last class.
  4. You can start working on the next assignment. It is due on September 29.
  5. Industry updates

My notes: Using the work of others

  1. Web directories
    1. Defining characteristics
      • Defined categories of topics (ex: Yahoo Directory's categories)
      • Categorized sites
      • Browse or search
    2. Searching differs from full text search: should think about query terms as defining a class of results, not a specific result
    3. Role it fulfills
      • Allows an organization to gather, collect, and organize Web resources in an organized hierarchy. Keeps a user from having to search through the whole Web to find the best resources.
    4. Examples
  2. Groups & Mailing lists
    1. Defining characteristics
      • Messages between users sent over a period of time
      • Email might be involved (or it might not!)
      • Descriptions are searchable
        • But messages are only searchable sometimes
      • Searching descriptions is like searching a web Directory
        • Searching messages is a very strange, inexact process
    2. Role it fulfills
      • Very up-to-date, highly personalized information in the messages
        • As searchable information, it's not so great, but as a resource for yourself, it's invaluable
    3. How to locate: Either through group directories, mailing list services, web directories, or full text search engines
    4. Examples
  3. Glossaries, dictionaries, & encyclopedias
    1. Defining characteristics
      • Definitions of a set of related terms
      • A company, an organization or a person might assemble these
    2. Role it fulfills
      • Helps you get familiar with a topic area more quickly than otherwise
    3. How to locate
      • Use specialized search process in a full text search engine
      • There are also a few collections of these documents
    4. Examples
  4. Assembled pages of links
    1. Defining characteristics
      • An evolving set of links that a person or organization has gathered on a specific topic
    2. Role it fulfills
      • Allows an organization or person to share its expertise
    3. How to locate
      • Use specialized search process in a full text search engine
    4. Examples

In-class examples

Full-text versus Web directory

Hemangioma

Stephen Hawking

Investment guidance

Groups & mailing lists: Behavioral finance

  1. Yahoo Groups
  2. Google Groups
  3. Behavioral finance at The Mail Archive — 35 messages

Directories

Behavioral finance

  1. behavioral finance at Yahoo Directory — 84 categories, plus 196 individual URLs
  2. Behavioral finance at DMOZ — 3 categories (not good matches) and 4 sites
  3. No results at Mahalo
  4. No results at Alltop

Market research in the consumer packaged goods industry

  1. Yahoo Directory
  2. dmoz
  3. Mahalo — no results
  4. Alltop — no results

Glossaries

Look at The Glossarist, eGlossary, and Frank Dietz's Glossary Listing.

Links, directories, resources, and what's new: Investment guidance

  1. investment guidance links — 349,000 documents (1.77 million in 2007)
  2. 'investment guidance' links — 15,300 documents (18,000 in 2007)
  3. intitle:'investment guidance' links — 155 documents (125 in 2007)
  4. intitle:'investment guidance' directory — 11 documents (10 in 2007)
  5. intitle:'investment guidance' resources — 90 documents (86 in 2007)
  6. intitle:"investment guidance" "what's new" — 3 documents (5 in 2007)
  7. intitle:'investment guidance' (links OR directory OR resources OR 'what's new') — 146 documents (143 in 2007)
  8. intitle:"investment guidance" (intitle:links OR intitle:directory OR intitle:resources OR intitle:"what's new") — 9 documents

Resources

  1. Directories
    1. Yahoo Directory
    2. Open Directory Project
    3. Mahalo
    4. Alltop
    5. Best of the Web (been around since 1994!)
    6. The WWW Virtual Library
    7. Starting Point
  2. Groups
    1. Yahoo Groups
    2. Google Groups — more than 1 billion Usenet messages over 25 years
      1. Usenet timeline
      2. What is Usenet? (Part 1, Part 2)
      3. Google Groups Help Center
  3. Glossaries and dictionaries
    1. The Glossarist — dictionary of glossaries.
    2. eGlossary — can't find out when this was updated.
    3. Frank Dietz's Glossary Listing — strange, but it's good
    4. Information.Org's listings for Glossaries
    5. Google Directory Listing of Dictionaries
    6. Yahoo Directory Listing of Dictionaries
    7. Law.com Legal Dictionary
    8. TechWeb Encyclopedia
    9. Webopedia — computer & Internet technology definitions
  4. Email lists
    1. Mail Archive — archive of 9000 (4840 in 2007) mailing lists, containing 57.2 million (48.5 in 2007) messages.
    2. Tile.net — the reference guide to email newsletters & discussion lists
    3. Topica appears to have changed their focus. They certainly don't advertise that they have than email discussion list directory — - but they do. You can search either the list description or the messages themselves.

07 RSS Intro

by samooresamoore (27 Oct 2008 23:17)

09/24/2008

We are going to introduce the topic of RSS feeds.

Class structure

  1. Go through “At beginning of class” information
  2. Go through slides for RSS Intro lecture.
  3. Demonstrate bloglines.
  4. Work on exercises.

At beginning of class

On your own

  1. You should go to the SiteMaker page for this class.
    • Enter your personal information under "Student".
    • If you have completed a wiki-based assignment (blog entry, industry update, notes, or questions), then enter that information under "Add Wiki Assignment".
    • Do this now.
  2. Be sure to read the announcement about extended office hours that I'm holding to discuss your term project topics.
  3. Go over any other announcements you might have missed since last class.
  4. Check who is doing class notes for today.
  5. Check who is doing industry updates for next class.
  6. I will return blogs next class. Why? Two reasons: 1) I want to have more turned in before I assign these grades. 2) I want you to have entered this information into SiteMaker so that I can test the grade recording process.

I'll go over

  1. What this class should be like so far
  2. Any questions about this class so far this semester? Where we're going? Anything at all?
  3. Next class you'll be turning in the search tool data analysis assignment.
  4. Next class your first status report for the term project is due. You will not be turning anything in on this date. However, you will need to update your SiteMaker page and the list of student wikis.
  5. Industry updates: No one signed up for one, so I'll do it.
    1. A first look at Google's new phone
    2. Google does to books what it did with YouTube
    3. Will Google renew the browser wars of the 1990s?
    4. Google's 10th birthday site
    5. What will America decide?

My notes

The Internet is changing all of the time. New resources are being added at a phenomenal pace in millions of different sites. You can't keep up with everything on your own. You need help.

It's all about getting computers to work for you, to work while you're not using it. Use the computer to search through information so you don't have to. Use the computer to deliver information to your email inbox or to a specific Web page so you don't have to go get it. You don't have to remember to do the query.

You still have to define the search. You probably have to spend more time up-front when defining the query.

  1. RSS
    1. What it stands for
      • Really Simple Syndication
      • Rich Site Summary
      • RDF Site Summary
    2. RSS is an application of XML.
    3. RSS is an open definition so anyone can use it.
    4. RSS is a standard widely adopted by millions of Web sites
    5. If you have a Web site that is updated relatively frequently, it makes sense to put these updates into an RSS feed.
  2. Compare HTML and XML
  3. For our purposes, what are the benefits of XML (and, hence, RSS)
    1. Can easily be translated into HTML for display purposes
    2. Can specify "fields" that can be searched
  4. So, what does this mean for RSS?
    1. RSS a common representation for lots of databases and lots of Web sites
    2. This common representation means lots of tools can be specially written to work with that standard (send it, search it, slice it, dice it)
  5. So, what does this mean for you?
    1. Saved time
    2. Saved attention
  6. Classes of RSS feeds
    1. Blogs
    2. Newspaper articles
  7. Online feed readers
    • Why not feed reader application?
  8. Where can you find RSS feeds
    1. In RSS feed directories (with search)
    2. In searchable subject indices of RSS feeds (with browsing)
    3. On RSS-enabled Web pages
    4. Created keyword-based feeds at search engines
  9. Types of RSS feeds
    1. Static feeds
    2. Keyword-based feeds

Resources

Online feed readers

Where can you find RSS feeds

  1. In RSS feed database (with search)
  2. In searchable subject indices of RSS feeds (with browsing)
  3. On RSS-enabled Web pages
  4. Created keyword-based feeds at search engines

RSS feeds from Wikidot

  1. RSS feeds for a separate page
  2. Be notified: RSS feed guide

08 RSS Lab

by samooresamoore (29 Sep 2008 15:01)

09/29/2008

We are going to work through more exercises allowing you to explore RSS feeds and related tools.

Class structure

  1. Go through “At beginning of class” information
  2. No lecture today.
  3. Work on exercises.

At beginning of class

On your own

  1. Check who is doing class notes for today — and change your sign-up date. We're not going to have any lecture today so there's nothing for you to write about. Sorry!
  2. Bring your print out of the search tool data analysis assignment to me at the podium.
    • I do not yet have a key to this assignment. I don't know what the answer is. I hope to discuss this next class.
  3. Go to your grades on the SiteMaker page.
    • I have added a menu choice “Basics/Grades” that allows you to access your grades from anywhere within this wiki.
    • From the SiteMaker grade page, do the following:
      • Choose “Add Grade Record” and then input (and save) your uniqname.
        • Within a couple of days I will input your assignment grades (from the data gathering assignments) here so that you can see them.
      • Add your analysis assignment under the “Add Wiki Assignment” menu choice.
    • I have not yet graded industry reports, notes, or questions. I will have these graded by next class.
  4. You can check to see if I have recorded any grades for you on the SiteMaker page.
    • All blog entries, industry updates, notes, and questions are points out of 10.
    • I have several printed blogs for which you have not entered information yet.
    • A “9” grade on a blog is what I would call a “normal, high-quality, well-written, informative blog entry.” A “10” means that you exceeded this standard. Your entry was somehow more informative, more insightful, more engaging (don't discount this — I very much welcome reading an interesting well-written entry with a good story integrated into it) than my expectations.
    • If you get a “10” on a blog entry, I want you to copy your blog entry from your wiki to my wiki. Create a page with the same name (i.e., “blog:XXX”) but it should be in the class blog. Do this as soon as you see your grade. Thanks.
  5. I thoroughly enjoyed meeting with most of you last week and discussing your term projects. I'll be checking your wiki home page in the version as of the beginning of class.
  6. Go over any other announcements you might have missed since last class.
  7. Check who is doing industry updates for next class.
  8. Check the start page for any blogs you might be interested in that you might have missed.
  9. From now on you don't have to “turn in” blogs with a piece of paper. You simply need to record the page within the SiteMaker page. I will be able to see that the page doesn't have a grade and will grade it (with a goal of the next class for the rest of the semester).

I'll go over

  1. A little feedback about this class compared to last year
  2. Feedback about blogs
    • it's versus its
    • inurl versus site
    • good examples posted
  3. Upcoming assignments
  4. Industry updates

09 News Search

by samooresamoore (14 Oct 2008 02:19)

10/01/2008

We learn about the major news search tools, as well as how to integrate them with your knowledge of RSS.

Class structure

  1. Go through “At beginning of class” information
  2. I'll lecture for a bit (using some slides).
  3. Work on exercises.

At beginning of class

On your own

  1. Be sure to have all of your information entered under “Basic/Grades”. Otherwise, I can't record your grades!
    • Do not leave class today if your information is not entered into the SiteMaker site.
  2. Personal
    • My wife's MRI came back with less-than-encouraging news.
    • She now has a biopsy scheduled for next Monday during class. I should be with her.
    • So we're not having class next Monday.
  3. Check who is doing class notes for today. I had to change this a bit for this class and next Monday because of the previous item.
  4. Assignments
    • Because of the turmoil of yesterday in my family, I didn't get much grading done.
    • I hope to have more graded soon.
    • I hope to discuss the search tool data analysis assignment next class.
  5. Go over any other announcements you might have missed since last class.
  6. Check the start page for any blogs you might be interested in that you might have missed.
  7. From now on you don't have to “turn in” blogs with a piece of paper. You simply need to record the page within the SiteMaker page. I will be able to see that the page doesn't have a grade and will grade it (with a goal of the next class for the rest of the semester).

I'll go over

  1. Let's look over the upcoming industry updates.
  2. A little feedback about this class compared to last year
  3. Industry updates
    • None today

Resources

News search

Print newspapers


11 The Deep Web

by samooresamoore (14 Oct 2008 02:18)

10/08/2008

We learn what the deep Web is and why we need to care about it.

Class structure

  1. Go through “At beginning of class” information
  2. I'll lecture for a bit (using some slides).
  3. Work on exercises.

At beginning of class

  1. My wife's biopsy went well. We'll find out the results by no later than Monday (or so we're told).
    • Thank you for your kind words of support.
  2. Check your wiki grades. If you got a 10, please be sure to add the blog to the class wiki.
  3. Be sure to have all of your information entered under “Basic/Grades”. Otherwise, I can't record your grades!
    • Don't forget — this is how you “turn in” your assignments.
  4. Blog entries
    • to/too
    • choose/chose (lose/loose)
    • definitely/definately/defiantly
  5. Check who is doing class notes for today.
  6. I'm working on grading and recording. I'm about half way through grading your analysis assignments.
  7. This is my write-up of the analysis assignment.
  8. Industry updates

My notes

  1. General Web search
    • Suggests that all information can be searched within one system
    • Easy and self-explanatory
    • Has only a limited understanding of "structure"
  2. The Invisible Web
    • "Invisible" to the general search engines since they don't index it
    • You'll hear about the "Invisible Web" or the "Deep Web" — same thing
    • Pages that are invisible
      • Disconnected page
      • Page consisting primarily of images, audio, video
      • Flash, Shockwave, compressed files
      • Content retrieved as a result of filling out forms
      • Real time information (ex: stock quotes)
      • Pages that are proprietary
    • Significance of the Invisible Web
      • Bergman's widely-cited statistic is that there are 550 billion documents in the invisible Web
        • Others believe it's more like 20-100 billion
      • Estimated that there's about 300K Web sites with queryable databases
  3. Academic Web-based search
    • More academic content is moving to the Web exclusively
    • Part of general trend from print to electronic
    • Much of this is contained in the Invisible Web
  4. Explain how search engines work
    • General
      • Crawlers go out and send information back to the central database
      • Queries go against the central database
      • SE company expertise is in design of the index and design of the query process (including input interface and output formatting and reporting)
    • Academic
      • Crawlers go out, find a database, and what? Index the query interface page? Send some standard queries to the index and save the results?
  5. Should you consider using Google Scholar?
    • Pros
      • A cross-database (federated) search engine
      • Returns snippets from articles (and sometimes abstracts)
      • Indexes the full text (actually, part of the full text) and not just the abstracts and subject terms
      • Can link to your own school's library
    • Cons
      • Secretive about its coverage of specific publishers, journals
      • Limits it searches to the first 100-120K of a page
      • Hasn't been updated much (at all?) since its launch
      • Returns far fewer documents than the native search engines
      • Searching by field is fairly unreliable and counter-productive
  6. What do we want from an academic search engine?
    • Comprehensive
      • Contains lots of journals over lots of topics
      • Goes far back in time
      • Up-to-date
    • Integrated across databases
    • Integrated into a database
    • Transparent as to what it contains or doesn't contain
  7. Recommendation
    • Use Google Scholar
      • as a way to find free, online versions of articles you already know you want
      • like you use Wikipedia — as a good starting place for exploring
    • Use the other Deep Web search tools — Scirus, Turbo10, plus the LII.
    • To do a complete search, you should definitely talk to a librarian and use the Library's immense set of resources.

In-class exercises

  1. Google Scholar
    1. Basics
      1. intitle:"carbon trading" — 271 citations
        • Cited by
        • Referenced by (under “Related articles”)
        • Web search
        • Availability at UM library
    2. Weird logic
      1. the — 2.03 billion records
      2. a — 13.1 million records
      3. a OR the — 13.6 million records
    3. Subject groups
      1. intitle:Vietnamese — 9,690 records
      2. allintitle:Vietnam — 816,000 records (all subject areas)
        • Shockingly, as you will soon see, intitle:Vietnam returns the exact same results (which it should).
      3. allintitle:Vietnam — 29,100 records (with all of the subject areas checked)
      4. allintitle: Vietnam OR Vietnamese — 104,000 records (notice that this is less than the 816,000 found for Vietnam alone above)
      5. allintitle: Vietnam OR Vietnamese — 141,000 records (with all of the subject areas checked)
      6. Publication year strangeness
        1. allintitle: Vietnam OR Vietnamese 1435-2008 — 151,000 records
        2. allintitle: Vietnam OR Vietnamese 1960-2008 — 152,000 records
        3. allintitle: Vietnam OR Vietnamese 2010-2050 — 12 records
  2. Scirus
    1. title:carbon AND title:trading (market) — 560 hits
      • Ability to filter on the left (sources, file types)
      • Recommendations of refining your search on the left
      • Save or email the results.
      • Sort by relevance or date.
      • Similar results
  3. Turbo10
    1. Search for [carbon trading] at Turbo10.
      • Topic clusters
      • Engines
  4. BNet
    1. carbon trading
      • Content types to right
      • RSS feeds

Possible blog entries

There are two possible blog entries related to this class — you can write one, both or neither of these. But I would find these interesting.

  1. Write a blog entry on what you observed, what you learned and found interesting, focusing on information that other students might find useful.
  2. Go talk to a Ross librarian. Tell them your topic and ask what 3 to 5 databases or tools that you might find most useful given that topic. See what databases they might tell you to focus on. Use them for a while. By the end of the semester, write a blog entry describing how the information you find in these databases differs from what you would find in the Web at large or what you found in the Deep Web search tools we were introduced to above.

BTW, I would find it rather remarkable if you didn't have in your term project a section or group of resources or something related to information a person could get in a library's database (compared with Deep Web and the Web itself).

Resources

Search tools

  1. Scirus
  2. Google Scholar
  3. CompletePlanet — 70K databases (but appears to be dead!)
  4. Amazon Advanced Book Search — Yes, I am including Amazon, the book seller, on this list.
  5. University of Michigan Library
  6. InfoMine (UCal, Riverside)
  7. Librarian's Internet Index
    • Overview: describes who they are, what they do, and what you might expect to get from looking at their site.
  8. Directory of Open Access Journals — 1262 journals are searchable at the article level, 211,294 articles.
  9. Turbo10
  10. Directory of Open Access Journals
  11. Microsoft Live Search Academic — closed down in May 2008.

Articles

  1. Accessing the Deep Web
  2. Exploring the academic invisible Web
  3. Google Scholar revisited by Peter Jascso, Online Information Review, 32:1, 2008, pp. 102—114.
  4. The Deep Web: Surfacing hidden value
    • As summarized by the editor of The Journal of Electronic Publishing: "Michael K. Bergman, whose BrightPlanet company offers a new approach to search engines, examines the wealth of information that is available only on dynamically created Web sites, those that don't exist except as relational databases until someone seeks information from them. As more sites adopt the dynamic approach to pages, they are creating a challenge for standard search engines. This article looks at some alternatives."
  5. Search engine technology and digital libraries: Libraries need to discover the academic internet
  6. Google Scholar -- a new data source for citation analysis, by Anne-Wil Harzing, February 5, 2008 (7th version).

E-books

  1. Google Book Search
  2. Project Gutenberg
  3. American Memory (by the U.S. Library of Congress)
  4. Million Book Project
  5. Google Electronic Text Archives

Other


12 Project Day

by samooresamoore (13 Oct 2008 15:29)

10/12/2008

We are going to discuss the term project.

At beginning of class

  1. I will pass the analysis write-up assignments back.
  2. You should check your grade records. I have entered your grades for almost all of the blogs, for the first data assignment, and for the analysis assignment.
  3. The following students do not have grade records. Show them what to do:
    • Roopak
    • Hernando
    • Bryan
  4. Short notes about your writing:
    • separated/seperated
    • cite/site
    • your/you're
    • digits of precision
    • [query delimiters]
  5. Notes about search write-up
    • relevancy
    • meta-search engine
  6. Additional research topics (for search engine experiment)
    • Relevance of top 5/10/20
    • Overlap of relevant in top 5/10/20

Analysis assignment information

Descriptive statistics

  • Average: 83.6
  • Median: 84
  • Standard deviation: 11.2

Distribution of grades

5 |0
6 |
7 |68
8 |003335577
9 |026
10|2

Grading methodology

Questions & queries
you got 10 points if you showed both your questions and queries
Data that I collected
you got 10 points if you showed all of the data
Results
There are 40 possible points here.
  • No statistics: 5 pts/ up to 10 points if discussion is good.
  • Mean information: 15 pts/ up to 20 points if discussion is good.
  • Spread information (i.e., s.d.): 25 pts/ up to 30 points if discussion is good.
  • hypothesis testing: 35 pts/ up to 40 points if discussion is good.
Discussion
There are 40 possible points here. I looked for responses related to meaning, recommendations, learning, and further questions. I read everyone's responses, and grouped like responses together while ranking them. I then assigned points from 15 to 37 depending on the relative quality and completeness of your responses.

13 Email Alerts

by samooresamoore (07 Nov 2008 23:53)

10/15/2008

We are going to discuss the different types of email alerts, how to find them, and how to generate them yourself.

Class structure

  1. Go through “At beginning of class” information
  2. Go through diagram explaining email alerts
  3. Work on exercises

At beginning of class

  1. I switched the order of “Email alerts” and “Page monitors”. Those who were supposed to give industry updates on the PM day should still give them today as scheduled.
  2. I haven't graded anything new.
    • I'm giving a seminar on Thursday (tomorrow) about the technology that I use to teach this class (wiki, videos, presentations).
  3. Check the class notes page to see who is taking notes today and who is coming up with questions.
  4. Go over any other announcements you might have missed since last class.
  5. Check the start page for any blogs you might be interested in that you might have missed.
  6. From now on you don't have to “turn in” blogs with a piece of paper. You simply need to record the page within the SiteMaker page. I will be able to see that the page doesn't have a grade and will grade it (with a goal of the next class for the rest of the semester). Be especially certain to do this before Fall Break.
  7. Let's look over the upcoming industry updates.
    • For this class: None
    • For next class:
      • Search industry update: Andrew Hermatz (HermatzHermatz)
      • Google update: Dylan Burkhardt (dylanbdylanb)

My notes

The following is the drawing that I created in class describing the relationship among all of these change notification technologies.

changenotification.jpg

Monitoring changes

  1. Email alert service
    • Monitor entire site
    • No false positives
    • Sometimes you want email (cell phone! or even Messenger)
  2. Next class
    1. RSS feeds
      • False positives
        • Unless keyword based or filtered somehow
    2. Page monitors
      • Lots of false positives unless keyword based
      • Monitor specific pages (but not sites)
  3. Why not just use RSS
    • Some sites don't have RSS feeds
      • So use site-based email alerts
    • Some information isn't site based
      • So use search-based email alerts
    • Some information is too fine-grained to be covered by RSS feeds
      • So use page monitors (next class)

Finding email alerts

  1. Search for email alerts
    • Query: "email alerts" OR "e-mail alerts" OR "email alert" OR "e-mail alert"
    • Google results (77.2 million in 2008) (60.2 million in 2007)
    • Yahoo results (392 million in 2008)
  2. More specific search for email alerts
    • Query: inurl:mail OR inurl:alert "email alerts" OR "e-mail alerts" OR "email alert" OR "e-mail alert"
    • Google results (115,000 in October 2008)
    • Yahoo results (242,000 in 2008)
  3. Science email alerts
  4. Copper email alerts

General email alert services

  1. Yahoo Alerts
  2. Google Alerts
    • All of this is based on submitting queries
      • Once a day
      • Once a week
    • Broad-ranging alerts
      • Web & comprehensive alerts
    • More specific
      • Keyword-based alerts for news, blogs, video and groups
  3. GoogleAlert
    • Advanced search form is the thing that makes it useful compared to Google's alert service

Email filtering

  1. Gmail
    • Limit around 7.2GB (4.5GB in October 2007)
    • Can use a filter
      • To forward just some emails (to different people?)
      • To apply a label to emails
    • Plus addressing
      • A powerful method that can be applied to Email alerts is using “plus addressing” service when you sign up for an Email alert (e.g. from some query), i.e. tell them that your address is dummy+moc.liamg|reifitnedIyreuQemos#moc.liamg|reifitnedIyreuQemos instead of the normal address moc.liamg|ymmud#moc.liamg|ymmud. Thus, if you get this address to your mail account, you can filter it by what comes after the plus! This is a extremely helpful since it makes it easier to filter emails.
      • Description for GMail and description for Yahoo Mail
      • Use a different address for each email alert
      • Helps you filter
      • Helps you track who is selling your email address
      • Can be used with GoogleAlert but not Google Alerts (the Google-affiliated site)
  2. Defining a filter
    • Keep definition to a minimum, as simple as possible
    • Test, test, test

Project wiki clarification

You should do the following for your project wiki:

  1. You should figure out some way that you are going to document the email alerts that you use in your email account to route your incoming alerts. Maybe print the alert page to a PDF file and link it to your wiki? Maybe take a screenshot of your email inbox and highlight the email alerts?
  2. In either case, you are going to want to have a section in your wiki called "Email alerts".
  3. On this page you should describe each of the email alerts that you used: the page from which you subscribed to it, why it is useful, and if there are any keywords (or such) that you used to generate it.

14 Page Monitors

by samooresamoore (02 Dec 2008 01:16)

10/22/2008

We discuss page monitors and then go into some detail about advanced RSS feed-manipulation Web sites.

Class structure

  1. Go through “At beginning of class” information
  2. Go through diagram explaining page monitors & RSS filters
  3. Work on exercises

At beginning of class

  1. I haven't graded anything new.
  2. Check the class notes page to see who is taking notes today and who is coming up with questions.
  3. Go over any other announcements you might have missed since last class.
  4. Check the start page for any blogs you might be interested in that you might have missed.
  5. I have arranged the Yahoo visit for early December. Don't miss that class!
  6. I want to review what this class is about.
  7. Let's look over the upcoming industry updates.
    • For this class:
    • For next class:
      • Search industry update: Max Rossiter

My notes

notificationsAndFiltering.jpg

Page monitoring software

Overview

Page Monitors were the next big thing five years ago. It is a program or web based program that you download. Each day (or whatever time period you want to set) it downloads the webpage, and if it's different it will send you an email. Some tell you what has changed while others just tell you that it has changed.

At first, you might not be that impressed with page monitors. But after realizing that it can be used for a lot more than news, it can be quite a useful tool. WatchThatPage.com is the best free site.

WatchThatPage has a limit of 250 characters for the URL. Also, shortened URLs (from tinyurl.com) do not work. To get around these problems, use TrackEngine, where neither of these problems exist.

Feed creation software

Overview

Make a feed

From a page
  • Dapper
  • FeedYes
  • Feed43
    • Feed43 is a little bit more complicated. You have to find the actual html within the source code of the page.
      • Define Extraction Rules – By finding the specific places (within the code) of the information that you’re looking to have monitored by the RSS feed. There are directions for what specific code to use in the program.
      • Then click extract
      • Then you can give it a title, description, url, etc
      • Then put in where the title, date, etc are etc
    • If these sites are updated once a month, its too much of a hassle to make one of these (use a page monitor). But if it is updated daily and you want to monitor it, then it might be a good idea to make one!
    • Free, or $29/year for 20 hourly updates
    • My feeds
From other feeds

Purpose of today's tools

We’ve already done some Email alerts. But the remaining question is: why would we do all these different things that are available?

  1. Focused RSS feed — If you’re lucky, there is a keyword-based, or specific-topic defined, RSS feed available for a site you can subscribe to.
    • Specific sites (findable in all the usual ways)
    • Dapps at Dapper.net
    • Pipes at Yahoo Pipes
  2. General RSS feed: If there's simply a general RSS feed (such as "Yahoo breaking news"), then you should run that feed through a keyword tool:
  3. The following are useful if there's no RSS feed available on a page:
    • FeedYes: I would try this first since it's the easiest to use when setting up a feed.
    • Feed43: This is more powerful but more difficult to use.
    • Dapper: This is another powerful tool. One of the benefits of this tool is that you can use pre-defined feeds ("dapps").
  4. Page Monitor — Use Page Monitor to get any updates on a specific page sent to your Email account.
  5. Email alerts — This was a precursor to RSS feeds. Some sites will give you updates to their site via email, not RSS.

Hints about possible test questions

You're definitely going to be held responsible for the following topics:

  1. What WatchThatPage (as an example of a page monitor) can do
  2. What Dapper can do
  3. What Feed43 can do and how its search patterns work
  4. What Yahoo Pipes can do and how feeds can be manipulated (for example, Fetch Feeds, Union, Filter, Sort)
  5. Under what circumstances would you use each one of these tools (as opposed to another)

I'll add to this later but this should give you an idea of the type of questions that I might ask.

Possible blog topics

You do not have to write a blog. These are suggested blog topics if you were to write one. There are lots of possibilities in this class. Describe different ways that you found these tools useful. Describe how you used Yahoo Pipes, possibly differently than how we have described them here.

Resources

Page monitors

Web-based

  • WatchThatPage
    • Free (for any number of pages), or $20/year for priority service
    • Can highlight changes in pages
    • Changes sent in an email
    • Keyword matching
    • This site doesn't appear to be updated any more (3+ years)
  • TrackEngine
    • Free for 5 bookmarks, or $20/year for 10 pages, or $53/year for 50 pages
    • Highlights new content in HTML email
    • Monitors changes daily
    • Does do keyword matching
    • This site hasn't been worked on for 6+ years
  • Other possible sites: InfoMinder, ChangeDetect, Trackle

Mac software

Windows software

Examples


15 Tag Based Sites

by samooresamoore (27 Oct 2008 15:39)

10/27/2008

We discuss social news and bookmarking sites.

Class structure

  1. Go through “At beginning of class” information
  2. Go through diagram explaining page monitors & RSS filters
  3. Work on exercises

At beginning of class

  1. I haven't graded anything new.
  2. Check the class notes page to see who is taking notes today and who is coming up with questions.
  3. Go over any other announcements you might have missed since last class.
  4. Check the start page for any blogs you might be interested in that you might have missed.
  5. Again: I have arranged the Yahoo visit for early December. Don't miss that class!
  6. Let's look over the upcoming industry updates.

My notes

Introductory information

  1. Describing with tags
    • Taxonomies
    • Folksonomies
  2. Types of social sites
    • Social News (based on tagging)
      • Technology
      • Search & Internet marketing
      • For researchers & scientists
    • Social Bookmarking (based on voting)
  3. Features & dimensions
    • Voting
    • Current (e.g., “what's hot”)
    • Time periods (e.g., “last hour”, “today”, “last week”, “last month”)
    • Topic categories (e.g., “business”, “entertainment”)
    • Tags (both personal and shared)
    • Web storage & search
  4. Discuss the sites

General market size information

diggreddit.png

The following series of charts shows the relative traffic for several social news and/or bookmarking sites. The first chart shows that digg has about 7x the traffic of reddit. I am using these two sites as benchmarks for the two charts after that. (Each chart can only compare five sites.) In the first chart, digg is the benchmark, while in the second chart reddit is the benchmark. Again, remember that digg has about 7x the traffic of reddit.

socialnews1.png

The second chart shows the relative traffic of digg, stumbleupon, slashdot, propeller.com and fark. Here you can see that digg is approximately 3x the size of the next busiest sites, stumbleupon and slashdot. Propeller and fark are no more than 40% of the size of those smaller sites.

socialnews2.png

The third chart shows reddit as the clear size leader, though mixx (which only came onto the scene in any reasonable sense in May 2008) is growing quickly and relatively steadily. Newsvine is showing some signs of growth as well. Both dzone and sphinn have maintained a relatively steady amount of traffic for the last year.

bookmarkingsites.png

The fourth chart shows traffic comparisons for sites that are more focused on social bookmarking than news. There has been a clear growth in traffic over the last year with CiteULike and Diigo leading the way. Over the same period of time simpy has lost traffic, ma.gnolia has maintained, and Connotea has grown significantly.

diggcomparison.png

The fifth chart shows the relative traffic of the two largest social news sites (digg, stumbleupon), the largest social bookmarking site (delicious), and a smaller, but growing, social news site (diigo) — which can barely be seen at the bottom axis.

furlcomparison.png

Resources

Social News sites

General news

  • Digg (about, tour, search): “Digg is a place for people to discover and share content from anywhere on the web… We’re here to promote that conversation and provide tools for our community to discuss the topics that they’re passionate about.”
  • Reddit (about, search): “reddit is a source for what's new and popular on the web — personalized for you. Your votes train a filter, so let reddit know what you liked and disliked, because you'll begin to be recommended links filtered to your tastes.”
  • StumbleUpon (about, video intro, guide, search): “StumbleUpon helps you discover and share great websites. As you click Stumble!, we deliver high-quality pages matched to your personal preferences… This helps you discover great content you probably wouldn't find using a search engine.”
  • Propeller (about, tour, search): “Propeller is a social news portal, meaning that it is programmed by you – the audience. Our members post links to stories from all over the Web… Once the link has been posted, you can vote on it, comment on it, share it with friends, or bookmark it to read later.”
  • Fark (about, help, search): “Fark.com, the Web site, is a news aggregator and an edited social networking news site… The idea was to have the word Fark come to symbolize news that is really Not News.”
  • Mixx (about, tour, search): “You find it; we'll Mixx it. Use YourMixx to tailor the content categories, tags, specific users and groups, and we'll deliver the top-rated content as chosen by you and people who share your passions. So go ahead and whip up your own version of the web. Just tell us how you like it Mixxed and we'll deliver the best the web has to offer”
  • NewsVine (welcome, help, search): “At Newsvine, you can read stories from established media organizations like the Associated Press and ESPN as well as individual contributors from all around the world. Placement of stories is determined by a multitude of factors including freshness, popularity, and reputation. Contribution is open to all, and editorial judgement is in the hands of the community.”

Technology focus

  • Slashdot (about, help, search): “News for nerds. Stuff that matters.”
  • DZone (about, search): “Fresh links for developers.”
  • For Search & Internet marketing professionals
  • Sphinn (about, help, search): “Sphinn is a social site for search and interactive marketers. It's designed to allow you to share and discover news stories, read and take part in discussions, discover events of interest and network with others.”

For researchers & scientists

  • Connotea (about, guide, search on homepage): “Saving references in Conntoea is quick and easy. You do it by saving a link to a web page for the reference, whether that be the PubMed entry, the publisher's PDF, or even an Amazon product page for a book. Connotea will, wherever possible, recognise the reference and automatically add in the bibliographic information for you.”
  • CiteULike (help, search on homepage): “CiteULike is a free service to help you to store, organise and share the scholarly papers you are reading. When you see a paper on the web that interests you, you can click one button and have it added to your personal library. CiteULike automatically extracts the citation details, so there's no need to type them in yourself.”

Social Bookmarking sites

  • Delicious (about, video tutorial, help, getting started, search): “Delicious is a social bookmarking service that allows you to tag, save, manage and share Web pages all in one place. With emphasis on the power of the community, Delicious greatly improves how people discover, remember and share on the Internet.”
  • Furl ([ about], help, video tour, uses of furl, search): “Furl is a free service that saves important items on the Web, allowing quick retrieval for future access. Furl archives a personal copy of every page and provides a search service the full text of all archived items. Each Furl member has a personal archive of 5 gigabytes (GB), large enough to store tens of thousands of searchable items.”
  • Diigo (about, video tour, tour, search): “Bookmark, highlight, and add sticky notes to any web page. Organize your bookmarks and annotations by tags or lists. Multiple ways to share your bookmarks and annotations.”
  • Magnolia (tutorial, search): “How is Ma.gnolia different from other social bookmarking services? It starts with making the social side of social bookmarking work better. With contacts, groups and different ways to share bookmarks both within and outside of Ma.gnolia, we make working together on a casual basis or more formal projects fun and easy.”
  • Simpy (about, help, search on homepage): “Simpy is a social bookmarking service. With Simpy, you can save, tag and search your own bookmarks and notes or browse and search other users' links and tags. You can be open and share your links with others, or keep them private. Simpy also helps you find like-minded people, discover new and interesting sites, publish your bookmarks, detect and eliminate link-rot, etc.”

General articles

  1. 30 largest social networking sites (October 2008)
  2. Social bookmarking sites
  3. Articles on folksonomies

Possible blog topics

  • Compare two of the social news sites.
  • Compare one of the social bookmarking sites to delicious.
  • Review Connotea or CiteULike from a student's perspective.

16 Image Search

by samooresamoore (29 Oct 2008 15:50)

10/29/2008

We discuss and explore the variety of image resources and search tools available.

Class structure

  1. Go through “At beginning of class” information
  2. Go through diagram explaining page monitors & RSS filters
  3. Work on exercises

At beginning of class

  1. No new grades.
  2. The exam is one week from Monday (i.e., November 10).
    • You should review the questions that have been posted so far, add to them as necessary, revise them as you see fit, and add pages for questions from days that don't have questions. (Do this by clicking on the link for the appropriate day.) Your motivation for this is that more questions that you have seen before might be on the exam. All of this posting of/revising questions must be done by midnight at the end of a week from this Friday.
    • The exam will be multiple choice, true/false, short answer, and essay.
    • You will be responsible for my lectures, for my in-class demonstrations, for your exercises, for student industry/company updates, and for anything that I have written on the Web site.
    • Post any questions you might have about the exam on the class forum under “Assignments.”
  3. Check the class notes page to see who is taking notes today and who is coming up with questions.
  4. Go over any other announcements you might have missed since last class.
  5. Check the start page for any blogs you might be interested in that you might have missed.
  6. There are four more opportunities to write a blog entry.
  7. Again: I have arranged the Yahoo visit for early December. Don't miss that class!
  8. Let's look over the upcoming industry updates.

My notes

  1. Diversity of image search tools
    • Search the Web
    • Search public, site-specific images
    • Search for-pay or for-free images
    • Search for facial images
    • Use images to find other images
    • Search news sites for images
  2. General search
    1. Google Images"Ann Arbor" example, only photos of Ann Arbor
      • Can search by size, filetype, color, domain (and description, of course)
      • Can limit your search results to newscontent, faces, or photo content (under advanced search)
    2. Ask Imagesexample
      • Can search by size, filetype, color
    3. Yahoo Imagesexample
      • Can search by size, color, domain
    4. Exalead Imagesexample
      • Be sure to hover a cursor over any image in the results page.
      • Under “advanced search”, can narrow your search by size, orientation, and color (search help)
  3. Image-focused search engines
    1. TinEye
    2. PicSearch (and advanced search page)
    3. Pixolu — search for [eiffel tower]
      • Searches Google, Flickr, and Yahoo
  4. Face search
    1. Exalead
    2. Picitup
    3. FaceSaerch
    4. Google Images
  5. Image-specific search engines for their private database of images
    1. Flickr
    2. Compfight: a flickr search tool
      • Search for [eiffel tower]
      • Notice difference in searching "Tags only" vs. "All text"
  6. Searching the news sites for images
    1. Images at Yahoo News — you can't specify to search for images but they appear at the top of the results page
      • This works as a direct link — just replace the “carbon+trading” with your own search term
    2. Images at Google News — again, you can't specify to search for images directly
      • This works as a direct link — just replace the ``detroit+lions with your own search term
  7. Stock photography (recommended by Presentation Zen)
  8. US Library of Congress images

Blog ideas

Resources

  1. Where to find free images and visuals...

17 Custom Search Engines

by samooresamoore (03 Nov 2008 17:10)

11/03/2008

We discuss custom search engines, and how you can build your own.

Class structure

  1. Go through “At beginning of class” information
  2. Explain what custom search engines are
  3. Work on exercises

At beginning of class

  1. No new grades
  2. Be sure to vote tomorrow.
  3. The exam is one week from today (i.e., November 10).
  4. Check the class notes page to see who is taking notes today and who is coming up with questions.
  5. Go over any other announcements you might have missed since last class.
  6. Check the start page for any blogs you might be interested in that you might have missed.
  7. There are three more opportunities to write a blog entry.
  8. Let's look over the upcoming industry updates.

My notes

  1. What is it
    • A search tool that (usually) uses Google's search engine (as the back end) but that you can instruct the following ways:
      • Look in a certain list of URLs
      • Avoid a certain list of URLs
      • Append a set of terms to any user-supplied query
  2. How can it be used
    • Its own Web page
    • An iGoogle widget
    • Embedded in random Web page
  3. Why would you use it
    • Captures creator's knowledge of the field

Blog topics

  1. Describe how useful or not Google Custom Search Engine is for your site.
  2. Describe how you chose the sites to include in your custom search engine.
  3. Compare and contrast 2 (or more) different custom search engines.

Resources

  1. Topicle
  2. Eurekster Swicki
  3. Rollyo
  4. Google Custom Search Engine
  5. BuildASearch
  6. Yahoo Search BOSS — Build your Own Search Service
  7. 3 Guides to FireFox Quick Searches (Smart Keywords)

18 Exploring Connections

by samooresamoore (07 Nov 2008 23:34)

11/05/2008

We discuss several tools that you can use to explore connections among Web sites and, more specifically, from other Web sites to yours.

Class structure

  1. Go through “At beginning of class” information
  2. Go through diagram explaining the tools we'll be looking at today
  3. Work on exercises

At beginning of class

  1. My wife had another MRI last night at 3:40am, this time on her spine. It went fine — just took a long time.
  2. No new grades
  3. The exam is next Monday (i.e., November 10).
  4. I have arranged for a speaker from Google to come to class on Monday, November 24. He is the same speaker I had last year, and he did a fantastic job. I'm very much looking forward to his day in class.
  5. Check the class notes page to see who is taking notes today and who is coming up with questions.
  6. Go over any other announcements you might have missed since last class.
  7. Check the start page for any blogs you might be interested in that you might have missed.
  8. There are two more opportunities to write a blog entry.
  9. Let's look over the upcoming industry updates.

My notes

What you might want to learn about a Web site

  1. Who has created a link to your site (Web site)
    • inlink — a link from another site to your site
  2. Who has written about your site (blog)
  3. Who has subscribed to your RSS feed
  4. How do people find your site (search engine)
connections.jpg

What can you learn about your Web site

Yahoo tools

The Yahoo SiteExplorer allows you to explore all the web pages indexed by Yahoo! Search. View the most popular pages from any site, dive into a comprehensive site map, and find pages that link to that site or any page.

  1. link:http://www.somedomain.com shows what sites link to this particular site
    • Yahoo search for 'link:http://autos.yahoo.com/'
      • 5.5 million pages on this site
      • 7.4 million links coming to this site (i.e. inlinks) — click on the “Inlinks” button
        • This lists all the pages that link to autos.yahoo.com
        • Choices of inlinks
          • From all pages, or from all pages except this domain (i.e., not from yahoo.com), or from all pages except this subdomain (i.e., not from autos.yahoo.com)
          • To only this specific URL or to any page within this entire site
        • Outcomes
          • Inlinks From All Pages to Entire Site: 14.15 million links
          • Inlinks from all pages Except from this subdomain autos.yahoo.com to Entire Site: 11.04 million inlinks
          • Inlinks From All Pages to Only this URL: 7.37 million inlinks
          • Inlinks from all pages Except from this subdomain autos.yahoo.com to Only this URL: 4.27 million inlinks
          • Inlinks from all pages Except from this domain yahoo.com to Entire Site: 460,649 inlinks
          • Inlinks from all pages Except from this domain yahoo.com to Only this URL: 51,580 inlinks
  2. Yahoo search for 'link:http://autos.yahoo.com/ -site:yahoo.com' — 18,600 documents
    • This lists all pages not in yahoo.com that link to autos.yahoo.com.
  3. Yahoo: linkdomain:somedomain shows what sites link to any page within this site (note that there's no http)
  4. Find a specific page in the database

Google search tools

  1. 'link:http://somedomain' shows what sites link to this site
  2. Find a specific page in the database

Google Analytics

Demonstrate Google Analytics.

What you can learn about your blog

  1. Search Bloglines
    • Search for citations of 'lifehacker.com'
  2. Search through Technorati
    • To search Technorati for posts that link to a page within a specific URL, go to the Technorati advanced search page and look for “URL Search”.
    • Example lifehacker.com
    • You can sort by “freshness” or by “authority”
  3. Use Feedburner (now owned by Google)

Resources

  1. Searching for a New President: Obama Wins on Search Engine Optimization
  2. Yahoo SiteExplorer
  3. Linkology — an article from New York Magazine describing how blogs relate to each other.

19 Test

by samooresamoore (10 Nov 2008 16:25)

11/10/2008

We take the test today.

  1. Sit every fourth seat.
  2. All material other than your pencil/pen should be along the walls of the classroom, away from your desk. Turn off your cell phone and all electronics; put them in your possessions along the walls. Put all hats and coats with your possessions as well.
  3. You have 1 hour, 10 minutes to complete the test. It's longer than I thought it was going to be but it shouldn't be a time problem for you to finish it.
  4. Read the front of the test when you get it but do not open it until I say.

20 Geography Based

by samooresamoore (17 Nov 2008 16:13)

11/12/2008

We discuss all types of geography-based search tools and resources.

Class structure

  1. Go through “At beginning of class” information
  2. Go through diagram explaining the tools we'll be looking at today
  3. Work on exercises

At beginning of class

  1. No new grades
    • I've graded first three pages of your tests.
    • To do
      • Tests
      • Projects
      • Blogs
  2. Rest of semester
    • No test
    • Some project related resources
    • Many general-interest resources
    • This is a test
  3. I have arranged for a speaker from Google to come to class on Monday, November 24.
  4. Check the class notes page to see who is taking notes today and who is coming up with questions.
  5. Go over any other announcements you might have missed since last class.
  6. Check the start page for any blogs you might be interested in that you might have missed.
  7. Let's look over the upcoming industry updates.

My notes

Here's what we're doing today:

  • Mostly just you exploring some amazingly cool and useful Web resources.
  • Please please add resources that you know about that aren't on this list that you find to be useful and interesting.

International, country-specific Web search engines

  1. SearchEngineColossus.com: "International Directory of Search Engines"
  2. Yahoo International: Yahoo home pages from countries around the world

Google Maps

  1. Google Maps (tour, popular content, featured content)
  2. Google Maps Mania: "An unofficial Google Maps blog tracking the websites, mashups and tools being influenced by Google Maps."
    1. 100 things to do with Google Maps mashups: the most fun you can have with maps.
    2. Also look under "Google Maps Links/Other Google Maps Stuff" in the right column.
  3. Pictures on Google Maps
  4. Related tool: Google Earth

Travel

  1. Google Sightseeing: "Google Sightseeing takes you on tour of the world as seen from satellite, using the free Google Earth program, or Google Maps in your web browser. Each weekday your guides James and Alex present new weird and wonderful sights as suggested by readers."
  2. TripBase: "Tell us what you like. We'll tell you where to travel." (review)
  3. TripWolf: worldwide travel guide (review)
  4. Whole Travel: "it's your planet. travel it. preserve it." (review)
  5. Standard travel search tools
    1. Expedia
    2. Orbitz
    3. Travelocity
  6. Newer general travel search tools
    1. Kayak: travel aggregator; searches 140+ travel sites with one search
    2. UpTake: "your first step on a great trip"; "search over 1000 travel websites and 20M opinions at once" (review, review)
    3. WeGo: "Wego searches through 100+ travel sites in the time that it takes you to search one. We’ll help you find the best prices and connect you to the best place to buy." (review)
  7. Specialized travel information
    1. NYC subway
    2. Google Transit: plan a trip using public transportation
    3. Ixigo: travel in India
    4. Sprice: "smart prices to go…anywhere" (actually, focuses on hotels in SE Asia, India, Europe) (review)
    5. HopStop: Provides door-to-door subway and bus directions and maps for NYC. Currently expanding to other major cities. Very popular in Manhattan.

Local search

  1. Ask City: search for businesses, movies, and events
  2. MSN City Guides
  3. Yelp: restaurant reviews
  4. BooRah: "restaurant reviews, menus, pictures, and more" (review)
  5. Mapquest Local: restaurants, events, news, weather
  6. When.com: "where to go, what to do, local events" (review)
  7. OnYoMo: "find movies, restaurants, airlines, services" (review)
  8. WebLocal: Canadian local search (review)

Road trips and driving

  1. Mapquest directions (search cheat sheet)
  2. Mapquest RouteBuilder
  3. Multimap driving directions
  4. Google Maps driving directions
  5. RoadsideAmerica Maps: "Find oddities and tourist attractions and plan trips more quickly"
  6. MileByMile: free road map RV itinerary guides
  7. Outside the U.S.
    1. Streetmap.uk: Great Britain street and road maps
    2. Australian driving directions
    3. European driving directions (ViaMichelin)
    4. Mappy.com: road guide for U.S. and Europe
    5. TheAA.com: route planner for U.K. and Europe

Maps

Entertainment

  1. Brewster Jennings Protects America: "Remember playing "Where in the World is Carmen Sandiego" as a kid? Well now the new game Brewster Jennings Protects America brings this classic adventure into the 21st century by merging the game play with Google maps technology*. In the web-based Brewster Jennings Protects America game you race around the globe as a government agent trying to stop a deadly terror attack from taking place…. "
  2. Google Moon Map

Information

  1. Historical Maps
  2. Maps of current interest
  3. Perry-Castaneda Library Map Collection (Univ of Texas)
  4. NationalAtlas Map Maker: build your own layered map with a wide variety of information
  5. WorldMapper: "a collection of world maps, where territories are re-sized on each map according to the subject of interest. There are now nearly 600 maps."
    • Animation: be sure to check out this animation
  6. World Sunlight Map: "Watch the sun rise and set all over the world on this real-time, computer-generated illustration of the earth's patterns of sunlight and darkness. The clouds are updated every 3 hours with current weather satellite imagery."
  7. EarthPulse (National Geographic): a visual guide to global trends
  8. EarthTools: "find places, latitude/longitude, sunrise, sunset, elevation, local time, and time zones"

Commerce

  1. AuctionMapper: search eBay for auctions (info)
  2. Oodle: buy and sell locally (classifieds)
  3. LiveDeal: online local marketplace

Real estate

  1. HousingMaps: a mashup of Google Maps and Craigslist
  2. Trulia: real estate search
  3. PropSmart: real estate search (and community)

Interactive tools

  1. GMap Pedometer: plan your walking trips and measure their length
  2. Wikimapia: a mashup of Google Maps and Wikipedia. Completely addicting to explore.
  3. MapMyRun: a tool to plot your running route and see what the distance was

Clocks

  1. The World Clock - Time Zones: Current local times around the world
  2. Greenwich Mean Time: use this to set your clock to the right time whereever you are
  3. World Time Zone: find the time using a map

Mobile tools

  1. Mapquest Wireless
  2. Google adds local to mobile web search

21 People Search

by samooresamoore (17 Nov 2008 17:50)

11/17/2008

We discuss several different ways to find out information about people through their Web presence.

Class structure

  1. Go through “At beginning of class” information
  2. Go through diagram explaining the tools we'll be looking at today
  3. Work on exercises

At beginning of class

At my request, we're going to have a visit from Nigel Melville to help explain his very exciting new class for BBAs that he's teaching next semester: BIT/MKT 378 “Service Innovation Management”. The following video relates to the MBA version of this class (which he taught last year).

  1. Grades
    • I graded the tests.
      • Possible points: 108
      • Range: -3 to -64
      • Median: -11.5
      • At the end of class I'm handing them out; we'll go over them; you'll give them back to me before you leave this classroom.
    • To do
      • Projects (today was status check #3); next is last day of class
      • Blogs
  2. The new building (as of Thursday) — it's really really sweet
  3. The speaker from Google will be here next Monday.
  4. Check the class notes page to see who is taking notes today.
  5. Let's look over the upcoming industry updates.

My notes

Using Google to find people

  1. Searching phone numbers (details)
    1. phonebook
    2. bphonebook
    3. rphonebook
  2. Searching names
    1. "full name" location
    2. "full name" company
  3. Can do the same thing at other general search engines: Yahoo, Ask, Cluuz

People search engines

  1. Pipl
  2. Wink (people search, phone number) Will not only look for the specified name but similar names (i.e. Ted Kennedy, Theodore Kennedy…)
  3. 411 Locate (people search, reverse phone, reverse address, ZIP code lookup, area code, international calling codes)
  4. ZabaSearch (advanced search, review)
    • Ages are wrong
  5. Switchboard (White pages, Reverse phone lookup, Public records search)
  6. WhoWhere (people search, phone lookup)
  7. WhitePages (people search, find email, reverse phone, reverse address)
  8. SuperPages (reverse phone, reverse address, area code, ZIP codes)
  9. AnyWho (people search, reverse phone)
  10. Yahoo People search (people search, reverse phone #, email search)
  11. ZoomInfo (advanced search)
  12. Spock (people search on the web, blogs, social networks)
  13. iSearch (review)
  14. 123people (people search) (review, review)
  15. Yasni (review): people search
    • Uses a tag cloud customized to the person

Social site search

  1. PeekYou (people search, username search)
  2. YoName (people search, email search, reverse phone, social networks, blogs; "focuses on social media search looking through LinkedIn, FaceBook, Yahoo 360, Flickr, Friendster, MySpace")
  3. LinkedIn (professional social networking search)
  4. MySpace
  5. Facebook
  6. The Internet Address Book

Specialized

  1. Military Search (veteran, current, military ancestors)
  2. AMA's Doctor Finder
  3. International
    1. InfoBel (search by country, six continents)
  4. YourFamily (finding ancestors and lost relatives)
  5. Family Search (online birth, marriage, death, census, church, and other indices; run by Mormon Church)
  6. Birthday Database
  7. Riya (image search for people, places, times, tags, descriptions) (advanced search)
  8. FaceSaerch (yes, that's the right spelling!) — search for faces
  9. Namepedia (review): the name database

Obituary search

  1. Social Security Death Index (ancestery.com) (1875-current, US Social Security numbers)
  2. Obituary Daily Times search
  3. National Obituary Archive (obituaries, memorials, funeral homes)
  4. Obituary Central (obituaries, cemetary searches)
  5. NY Times Obituaries

Resources

  1. Ten Ways you can find Phone Numbers on the Web (about.com)
  2. Top Ten Ways to do a free people search on the Web (about.com)
  3. Fifteen People Search Engines (about.com)
  4. How to find someone online (about.com)
  5. 4 people search engines: Looking for someone online
  6. iSearch
  7. 123people searches the social web
  8. Facesaerch: search for people's faces
  9. Namepedia (review): the name database
  10. Yasni (review): people search
  11. Google, Yahoo, Ask, Cluuz
page 1 of 212next »
Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License