Content uploaded by Ron Kohavi
Author content
All content in this area was uploaded by Ron Kohavi
Content may be subject to copyright.
SIGKDDExplorations. Volume2,Issue2–page106
WEBKDD2000-WebMiningforE-Commerce
MyraSpiliopoulou
Otto-von-GuerickeUniversitaet.
Universitaetsplatz2,D-39016
Magdeburg
myra@iti.cs.uni-magdeburg.de
JaideepSrivastava
UniversityofMinnesota&Yodlee,Inc
3600BridgeParkway,Suite200
RedwoodCity,CA94065
srivasta@cs.umn.edu
BrijM.Masand
RedwoodInvestmentSystems
76SummerStreet
Boston,MA02110-1225USA
brij@redwood.com
RonKohavi
BlueMartiniSoftware
2600CampusDrive,SanMateo,
California94403
ronnyk@bluemartini.com
ABSTRACT
Inthispaper,weprovideasummaryoftheWEBKDD2000
workshop,whosethemewas‘WebMiningforE-Commerce’.
ThisworkshopwasheldinconjunctionwiththeACMSIGKDD
InternationalConferenceonKnowledgeDiscoveryinDatabases
(KDD-2000).
Keywords
Webmining,e-commerce,personalization,clickstreamanalysis.
1. THEME
The ease and speed with which business transactions can be
carriedoutovertheWebhasbeenakeydrivingforceintherapid
growth of electronic commerce. In addition, customer
interactions, including personalized content, e-mail campaigns,
andonlinefeedbackprovidenewchannelsofcommunicationthat
werenotpreviouslyavailableorwhereveryinefficient.TheWeb
is revolutionizing the way businesses interact with each other
(B2B)and witheachcustomer(B2C).Ithasintroducedentirely
newwaysofdoingcommerce,includinge.g.auctionsandreverse
auctions. It also made it imperative for organizations and
companiestooptimizetheirelectronicbusiness.
Knowledge about the customer is fundamental for the
establishmentofviablee-commercesolutions.Webminingfore-
commerceistheapplicationofwebminingtechniquestoacquire
thisknowledgefore-commerce.Typicalconcernsine-commerce
includeimprovedcross-sells,up-sells,personalized ads,targeted
assortments,improvedconversionrates,andmeasurementsofthe
effectivenessofactions.
TheWEBKDD 2000 workshopis thesecondworkshop held in
conjunctionwiththeACMSIGKDDInternationalConferenceon
KnowledgeDiscoveryin Databases(KDD)anddedicatedtothe
challengesofwebmining.WEBKDD'99focusedontheaspectsof
web mining related to user profiling; the long version of its
proceedings has appeared as volume 1836 of the Lectures in
Artificial Intelligence series (LNAI) by Springer Verlag. In
response to call for papers, WEBKDD 2000 received 31
contributions. Each was reviewed by at least three program
committee members. Seven submissions were selected for
presentationaslong papers,andsixasshortpapersreportingon
goodideasataratherpreliminaryphase.
The URL http://robotics.stanford.edu/~ronnyk/WEBKDD2000
containsthe finalversionsof theworkshoppapersandtheslide
presentations.
2. WORKSHOP
The KDD community responded very enthusiastically to the
WEBKDD2000workshop,andwereceivedfarmorerequestsfor
attendance(approximately110)thanthere wasspace.About85
people attended the workshop, which brought together e-
commercepractitioners,toolvendorsanddataminingresearchers.
Thepaperpresentationwasdividedintothreesessions.
The first session, titled Web personalization and recommender
systems, focused on how web mining can address one of the
fundamental issues of B2C e-commerce, namely personalized
customerexperience. Asoftendescribed byJeffBezos, CEOof
Amazon.com, and mentioned by Joseph Pine in his The
Experience Economy [1], customer experience is the key to
building customer loyalty to an on-line store, since leaving the
storeisexactlyoneclickaway.Inthissessionwehadthreelong
andtwoshortpapers, whichpresentedtheleadingedgeideasin
this important area. Mobasher, Dai, Luo, Nakagawa, Sun, and
Wiltshire’spaper,titled Discovery ofAggregateUsage Profiles
forWebPersonalization,describedhowusagedatafromweblogs
canbeanalyzed/minedtobuilduserprofiles,andhowthesecould
be use to enhance the user’s browsing experience.Vucetic and
Obradovic’s paper, titled A Regression-Based Approach for
Scaling-UpPersonalizedRecommenderSystems inE-commerce,
presented an approach to applying regression techniques to
understand user preferences for recommender systems. This
approachisinterestingsincestatisticaltechniqueshavenotbeen
appliedsufficientlytothisproblem.Sarwar,Karypis,Konstanand
Riedl’spaper, titled Applicationof DimensionalityReductionin
Recommender Systems – A Case Study, presented a novel
applicationofdimensionalityreductiontechniquesfromscientific
computingtotherecommendationsystemproblem.Thetwoshort
papers in this session, namely Lin, Alvarez and Ruiz’s
Collaborative Recommendation via Adaptive Association Rule
MiningandChangandYuan’sASynthesizedLearningApproach
for Web-Based CRM, presented early results in alternative
approachestotherecommendationsystemproblem.Insummary,
the wealth of interest in applying various techniques to the
recommendation system problem shows the centrality of this
problemtoWebpersonalization.
SIGKDDExplorations. Volume2,Issue2–page107
Thesecond session,titledMining frameworksandcase studies,
presented experience reports from four groups on using web
mininginvariouse-commerceapplications.Anyfieldofenquiry
musthaveits‘proofofthepuddingisineatingit’component,and
thecasestudiespresentedhereprovideexactlythatflavor.Ansari,
Kohavi, Mason and Zheng’s paper, titled Integrating E-
Commerce and Data Mining: Architecture and Challenges,
providedacomprehensiveoverviewoftheissuesinapplyingdata
miningtechniquestoE-commerce.Theauthorsbringalotofreal
worldperspectivefromtheirexperienceatBlueMartiniSoftware,
especially from the viewpoint of an E-commerce solution
provider.TheusingerandHuber’sAnalyzingthefootstepsofyour
customers, A case study by ASK|net and SAS Institute GmbH
presentedtheexperienceofapplyingdatamininginE-commerce
from a solutions perspective, where SAS’s tools were used to
solve a problem for ASK|net. Sanford Gayle, in his paper The
MarriageofMarketBasketAnalysistoPredictiveModeling:The
EssentialChallenge inExploitingWeb-LogFilesforPrediction,
presented an approach to using association and correlation
analysis to extract predictive models from web logs. Coenen,
Swinnen, Vanhoof and Wets’ paper A Framework for Self
AdaptiveWebsites:TacticalversusStrategicChallengeexamined
the various issues in building such sites. The essential tension
seemstobebetweenmakingawebsitepersonalizedtoindividual
users-maybeevendynamicallychangeitbasedontheparticular
user’sbehavior–andtheinformationoverloaditcancause.
The third session, titled Navigation analysis, focused on how
clickstreamdata canbeanalyzedtoextractvaluablee-commerce
knowledge from it. Being able to analyze clickstream data
providesanunprecedentedopportunitytounderstandindetailthe
processleadinguptoabuy/notbuydecisionvs.justrecordingthe
finaloutcome-asisthecasewithpoint-of-saledata.Clickstream
data is over 95%+ of all data collected in most large-scale e-
commerce environments, and contains a wealth of knowledge
embeddedin it.Berendt’spaper, titledWebUsageMining, Site
Semantics, and the Support of Navigation, provided a general
overviewoftheissuesinclickstreamanalysis,andhowthemined
knowledge can be used for supporting site navigation. Kato,
Nakayamaand Yamane’spaper NavigationAnalysis Toolbased
on the Correlation between Content Distribution and Access
Patterns,presentedanapproachwherebyminedpatternsfromsite
contentcanbecorrelatedwithminedpatternsfromsiteusage,and
atoolbasedonthisapproach.Investigatingapplicationsofsucha
toolwouldbe aninteresting lineof inquiry.Gaul andSchmidt-
Thieme’s Mining Web Navigation Path Fragments presented
somenovelalgorithmsforextractingnavigationalpathfragments.
Finally,TanandKumar’sModeling ofWebRobotNavigational
Patterns addressed the challenging and commercially important
problemofseparatingthesitevisitsofwebrobotsfromhumans.
This is crucial for at least two applications: (1) as competitive
pressuresincrease,commercesiteswouldliketoblockrobotsthat
collectsensitiveinformation,and(2)accuratemodelingofhuman
users’e-commercebehaviorrequiresthatwebrobotaccessesbe
filteredout.Whilethepapersinthissessionpresentsomeofthe
leadingideas, theresearchinthisareaisjust beginningand we
havebarelyscratchedthesurface.
3. CONCLUSION
WEBKDD2000turnedouttobeaverysuccessfulworkshopby
all measures. More than 110 people showed interest in the
workshop and over 85 attended it. The quality of papers was
excellent,thediscussionwaslively,and anumberof interesting
directions of research were identified. This is a strong
endorsementofthelevelofinterestinthisrapidlyemergingfield
ofinquiry.
4. REFERENCES
[1] B. JosephPine,James H.Gilmore, B.JosephPineII. The
Experience Economy. Harvard Business School Pr; ISBN:
0875848192,
http://www.amazon.com/exec/obidos/ASIN/0875848192/ref
=sc_b_1/103-2009916-9046229