Welcome to the Invelos forums. Please read the forum rules before posting.

Read access to our public forums is open to everyone. To post messages, a free registration is required.

If you have an Invelos account, sign in to post.

    Invelos Forums->DVD Profiler: Desktop Feature Requests Page: 1  Previous   Next
Datamine common sense from the database
Author Message
DVD Profiler Desktop and Mobile Registrantlmoelleb
Beer Profiler now!
Registered: March 14, 2007
Denmark Posts: 630
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
As I guess most people are aware, there are constant discussions on how strict the database should be and how much "common sense" should (or can) be applied etc.

But with the following data available:
  • The "strict by the rules" master database

  • Uploaded collections with users local data


  • it might be possible for a standard database mining algorithm (as build into the database, not something you have to program) to calculate how likely it is I would want another value than the entry in the master database in a specific field. It might be more tricky with ordered "list data" like cast etc, but I would not be surprised if that could be done as well, the data mining algorithms appear to be pretty smart.

    At least it would take away the discussion if something is "common sense" or not - because if it is "common" the data mining will show it.
    Regards
    Lars
    DVD Profiler Unlimited RegistrantStar Contributorruineddaydreams
    Registered: Dec. 2, 2002
    Registered: March 14, 2007
    United States Posts: 1,339
    Posted:
    PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
    this is a pretty good idea...
    -JoN
    DVD Profiler Unlimited Registrantnolesrule
    Registered: 09/21/2000
    Registered: March 15, 2007
    United States Posts: 366
    Posted:
    PM this userEmail this userVisit this user's homepageView this user's DVD collectionDirect link to this postReply with quote
    The only thing useful for data mining is for invading privacy....err, I mean searching for terrorists.

    And before anyone flames me for the comment, I'm just kidding. It's just a joke. I think this is a pretty good idea. 
    DVD Profiler Desktop and Mobile RegistrantStar ContributorDJ Doena
    Registered: May 1, 2002
    Registered: March 14, 2007
    Reputation: Highest Rating
    Germany Posts: 6,747
    Posted:
    PM this userEmail this userVisit this user's homepageView this user's DVD collectionDirect link to this postReply with quote
    The following is strictly IMHO.

    I don't think it'll help.

    Why? I think the majority of the DVDP users does the following "Add DVD by UPC" -> OK.

    I think, about 75 up to 90 percent of all worldwide users never changed a profile after downloading it from the master database.

    The consequence is that all these user have locally exactly the same data as it is in the master database. Thus the value in the master database "is always right".

    I think you'll hardly find a field where more than 50 percent of the users have something different than what is in the master. And even if, I am even more sure that they will have as many different entries as there are users.
    Karsten
    DVD Collectors Online

    DVD Profiler Desktop and Mobile Registrantlmoelleb
    Beer Profiler now!
    Registered: March 14, 2007
    Denmark Posts: 630
    Posted:
    PM this userView this user's DVD collectionDirect link to this postReply with quote
    Karsten,


    It looks like you are thinking about check of which value is used by most people which relates to datamining algoritms the same way as counting on your fingers relates to super computers.

    Even if it was a "finger counting" algorithm it might work with the proper constant, which would obviously not be 50% but a significently lower number. No, I do not know what it would be, that is something that has to be testet, and most likely you would have to be able to specify some preferences to indicate if you favour "common sense" over "accuracy". Obviously the percentage of people "suppressing" a "common sense" value in their database could be taken into account as well.

    I suspect it might be problematic to run this on profiles very few people have uploaded, but then it should just be disabled until a sufficient amount has been reached.
    Regards
    Lars
        Invelos Forums->DVD Profiler: Desktop Feature Requests Page: 1  Previous   Next