正文內(nèi)容

8-1數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘(完整版)

2025-02-04 18:10上一頁(yè)面

下一頁(yè)面

　　

【正文】 1 –1 /k) if each class the same number of instances. k i 1 p2i 169。Silberschatz, Korth and Sudarshan Database System Concepts 6th Edition More Warehouse Design Issues ? Data cleansing ? ., correct mistakes in addresses (misspellings, zip code errors) ? Merge address lists from different sources and purge duplicates ? How to propagate updates ? Warehouse schema may be a (materialized) view of schema from data sources ? What data to summarize ? Raw data may be too large to store online ? Aggregate values (totals/subtotals) often suffice ? Queries on raw data can often be transformed by query optimizer to use aggregate values 169。Chapter 20: Data Analysis 169。Silberschatz, Korth and Sudarshan Database System Concepts 6th Edition Warehouse Schemas ? Dimension values are usually encoded using small integers and mapped to full values via dimension tables ? Resultant schema is called a star schema ? More plicated schema structures ? Snowflake schema: multiple levels of dimension tables ? Constellation: multiple fact tables 169。Silberschatz, Korth and Sudarshan Database System Concepts 6th Edition Best Splits (Cont.) ? Another measure of purity is the entropy measure, which is defined as entropy (S) = – ? ? When a set S is split into multiple sets Si, I=1, 2, …, r, we can measure the purity of the resultant set of sets as: purity(S1, S2, ….., S r) = ? ? The information gain due to particular split of S into Si, i = 1, 2, …., r Informationgain (S, {S1, S2, …., Sr) = purity(S ) – purity (S1, S2, … Sr) r i= 1 |Si| |S| purity (Si) k i 1 pilog2 pi 169。Silberschatz, Korth and Sudarshan Database System Concepts 6th Edition Other Types of Classifiers ? Neural classifiers are studied in artificial intelligence and are not covered here ? Bayesian classifiers use Bayes theorem, which says p (cj | d ) = p (d | cj ) p (cj ) p ( d ) where p (cj | d ) = probability of instance d being in class cj, p (d | cj ) = probability of generating instance d given class cj, p (cj ) = probability of occurrence of class cj, and p (d ) = probability of instance d occuring 169。Silberschatz, Korth and Sudarshan Database System Concepts 6th Edition Finding Association Rules ? We are generally only interested in association rules with reasonably high support (., support of 2% or greater) ? Na239。Silberschatz, Korth and Sudarshan Database System Concepts 6th Edition Other Types of Mining ? Text mining: application of data mining to textual documents ? cluster Web pages to find related pages ? cluster pages a user has visited to anize their visit history ? classify Web pages automatically into a Web directory ? Data visualization systems help users examine large volumes of data and detect patterns visually ? Can visually encode large amounts of information on a single screen ? Humans are very good a detecting visual patterns End of Chapter 169。Silberschatz, Korth and Sudarshan Database System Concepts 6th Edition Figure 169。Silberschatz, Korth and Sudarshan Database System Concepts 6th Edition Finding Support ? Determine support of itemsets via a single pass on set of transactions ? Large itemsets: sets with a high count at the end of the pass ? If memory not enough to hold all counts for all itemsets use multiple passes, considering only some itemsets in each pass. ? Optimization: Once an itemset is eliminated because its count (support) is too small none of its supersets needs to be considered. ? The a priori technique to find large itemsets: ? Pass 1: count support of all sets with just 1 item. Eliminate those items with low support ? Pass i: candidates: every set of i items such that all its i1 item su

點(diǎn)擊復(fù)制文檔內(nèi)容

數(shù)學(xué)相關(guān)推薦

數(shù)據(jù)倉(cāng)庫(kù),聯(lián)機(jī)分析處理,數(shù)據(jù)挖掘datawarehousing,-資料下載頁(yè)

【摘要】數(shù)據(jù)倉(cāng)庫(kù),聯(lián)機(jī)分析處理,數(shù)據(jù)挖掘DataWarehousing,OLAP,andDataMining?數(shù)據(jù)倉(cāng)庫(kù):一個(gè)面向主題的、集成的、隨時(shí)間變化的、非易失性數(shù)據(jù)的集合，用于支持管理層的決策過程。?OLAP與數(shù)據(jù)挖掘工具:是兩種主要的分析工具，提供給決策者對(duì)數(shù)據(jù)進(jìn)行分析，以針對(duì)分析結(jié)果做出決策。概要

2025-05-15 00:04

2數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘項(xiàng)目建設(shè)-講義-資料下載頁(yè)

【摘要】數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘項(xiàng)目建設(shè)1.數(shù)據(jù)倉(cāng)庫(kù)知識(shí)簡(jiǎn)介軟件質(zhì)量控制的主要目的是為了獲得更高的開發(fā)效率，避免返工，提高產(chǎn)品的市場(chǎng)競(jìng)爭(zhēng)力，從而為客戶提高符合質(zhì)量需求的穩(wěn)定可靠的軟件產(chǎn)品，同時(shí)它也是控制方法的集合，包括軟件建模、度量、評(píng)審以及其他活動(dòng)。：1.目標(biāo)問題度量法，即通過軟件質(zhì)量目標(biāo)并持續(xù)觀察這些目標(biāo)是否達(dá)到軟件質(zhì)量控制的一種方法2.風(fēng)險(xiǎn)管理法，即識(shí)別與控制軟件開發(fā)中對(duì)成

2025-08-04 18:27

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘原理及應(yīng)用v-資料下載頁(yè)

【摘要】數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘原理及應(yīng)用東華理工大學(xué)理學(xué)院劉愛華目錄1.數(shù)據(jù)倉(cāng)庫(kù)基礎(chǔ)7.分類和預(yù)測(cè)2.數(shù)據(jù)倉(cāng)庫(kù)設(shè)計(jì)和實(shí)現(xiàn)8.關(guān)聯(lián)分析3.數(shù)據(jù)倉(cāng)庫(kù)實(shí)例9.Web挖掘4.OLAP和OLAM

2025-05-14 08:48

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘(陳志泊)——習(xí)題答案-資料下載頁(yè)

【摘要】數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘習(xí)題答案第1章數(shù)據(jù)倉(cāng)庫(kù)的概念與體系結(jié)構(gòu)1.面向主題的，相對(duì)穩(wěn)定的。2.技術(shù)元數(shù)據(jù)，業(yè)務(wù)元數(shù)據(jù)。3.聯(lián)機(jī)分析處理OLAP。4.切片（Slice），鉆?。―rill-down和Roll-up等）。5.基于關(guān)系數(shù)據(jù)庫(kù)。6.數(shù)據(jù)抽取，數(shù)據(jù)存儲(chǔ)與管理。7.兩層架構(gòu)，獨(dú)立型數(shù)據(jù)集市，依賴型數(shù)據(jù)集市和操作型數(shù)據(jù)存儲(chǔ)，邏輯型數(shù)據(jù)集市和實(shí)時(shí)數(shù)據(jù)

2025-06-28 13:58

數(shù)據(jù)倉(cāng)庫(kù)數(shù)據(jù)據(jù)倉(cāng)庫(kù)原理-資料下載頁(yè)

【摘要】1第第三三章章數(shù)據(jù)倉(cāng)庫(kù)原理2數(shù)據(jù)倉(cāng)庫(kù)結(jié)構(gòu)體系數(shù)據(jù)倉(cāng)庫(kù)的數(shù)據(jù)模型、轉(zhuǎn)換和裝載*元數(shù)據(jù)3數(shù)據(jù)倉(cāng)庫(kù)結(jié)構(gòu)體系數(shù)據(jù)倉(cāng)庫(kù)結(jié)構(gòu)數(shù)據(jù)倉(cāng)庫(kù)系統(tǒng)結(jié)構(gòu)數(shù)據(jù)倉(cāng)庫(kù)運(yùn)行結(jié)構(gòu)4近期基本數(shù)據(jù)：是最近時(shí)期的業(yè)務(wù)數(shù)據(jù)，是數(shù)據(jù)倉(cāng)庫(kù)用戶最感興趣的部分，數(shù)據(jù)量大。歷史基本數(shù)據(jù)：近期基本數(shù)據(jù)隨時(shí)間的推移，由數(shù)據(jù)倉(cāng)

2025-01-08 21:03

數(shù)據(jù)倉(cāng)庫(kù)-資料下載頁(yè)

【摘要】數(shù)據(jù)倉(cāng)庫(kù)DataWarehouse趙衛(wèi)東博士復(fù)旦大學(xué)軟件學(xué)院1事務(wù)型處理?事務(wù)型處理：即操作型處理，是指對(duì)數(shù)據(jù)庫(kù)的聯(lián)機(jī)操作處理OLTP。事務(wù)型處理是用來協(xié)助企業(yè)對(duì)響應(yīng)事件或事務(wù)的日常商務(wù)活動(dòng)進(jìn)行處理。它是事件驅(qū)動(dòng)、面向應(yīng)用的，通常是對(duì)一個(gè)或一組記錄的增、刪、改以及簡(jiǎn)單查詢等（大量、簡(jiǎn)單、

2025-03-09 12:39

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘基礎(chǔ)第3章數(shù)據(jù)預(yù)處理(趙志升)-資料下載頁(yè)

【摘要】1、數(shù)據(jù)預(yù)處理的意義2、數(shù)據(jù)清理3、數(shù)據(jù)集成與變換4、數(shù)據(jù)歸約第三章數(shù)據(jù)預(yù)處理1、數(shù)據(jù)質(zhì)量問題：?噪聲數(shù)據(jù)?空缺數(shù)據(jù)?不一致數(shù)據(jù)第一節(jié)數(shù)據(jù)預(yù)處理的意義預(yù)處理數(shù)據(jù)提高數(shù)據(jù)質(zhì)量提高挖掘結(jié)果2、數(shù)據(jù)預(yù)處理的基本方法：?數(shù)據(jù)清理

2025-03-09 09:10

數(shù)據(jù)倉(cāng)庫(kù)技術(shù)介紹(1)-資料下載頁(yè)

【摘要】數(shù)據(jù)倉(cāng)庫(kù)技術(shù)介紹了解你的組織了解你的客戶了解你的供應(yīng)商嵇曉內(nèi)容提要?動(dòng)機(jī)與需求?數(shù)據(jù)倉(cāng)庫(kù)技術(shù)?數(shù)據(jù)倉(cāng)庫(kù)在寶鋼的實(shí)踐?結(jié)束語(yǔ)面臨的問題人們?cè)谌粘Ｉ钪薪?jīng)常會(huì)遇到這樣的情況：?超市的經(jīng)營(yíng)者希望將經(jīng)常被同時(shí)購(gòu)買的商品放在一起，以增加銷售；?保險(xiǎn)公司想知道購(gòu)買保險(xiǎn)的客戶一般具有

2025-01-10 02:16

數(shù)據(jù)倉(cāng)庫(kù)3-數(shù)據(jù)倉(cāng)庫(kù)中的數(shù)據(jù)及組織-資料下載頁(yè)

【摘要】第3講數(shù)據(jù)倉(cāng)庫(kù)中的數(shù)據(jù)及組織1數(shù)據(jù)倉(cāng)庫(kù)產(chǎn)生的原因數(shù)據(jù)處理的類型?操作型處理（OLTP)：數(shù)據(jù)的收集、整理、存儲(chǔ)、查詢和增、刪、改操作。?分析型處理(OLAP)：數(shù)據(jù)的再加工，往往要訪問大量的歷史數(shù)據(jù)，進(jìn)行復(fù)雜的統(tǒng)計(jì)分析。2數(shù)據(jù)倉(cāng)庫(kù)的四個(gè)基本特征：?數(shù)據(jù)倉(cāng)庫(kù)的數(shù)據(jù)是面向主題

2025-03-09 12:38

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘技術(shù)第6章數(shù)據(jù)預(yù)處理技術(shù)-資料下載頁(yè)

【摘要】數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘技術(shù)第6章數(shù)據(jù)預(yù)處理技術(shù)主講人：孫水華副教授信息科學(xué)與工程學(xué)院目錄?數(shù)據(jù)預(yù)處理概述?數(shù)據(jù)清理?數(shù)據(jù)集成?數(shù)據(jù)變換?數(shù)據(jù)歸約?小結(jié)數(shù)據(jù)預(yù)處理(datapreprocessing)是指在對(duì)數(shù)據(jù)進(jìn)行數(shù)據(jù)挖掘主要的處理以

2025-05-15 00:05

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘第二章b-資料下載頁(yè)

【摘要】0第二章數(shù)據(jù)倉(cāng)庫(kù)原理1第二章數(shù)據(jù)倉(cāng)庫(kù)原理?數(shù)據(jù)倉(cāng)庫(kù)定義?數(shù)據(jù)倉(cāng)庫(kù)特征?數(shù)據(jù)庫(kù)體系化環(huán)境?數(shù)據(jù)倉(cāng)構(gòu)造模式?數(shù)據(jù)倉(cāng)庫(kù)概念結(jié)構(gòu)?數(shù)據(jù)倉(cāng)庫(kù)中的數(shù)據(jù)組織?小節(jié)2?數(shù)據(jù)倉(cāng)庫(kù)中的數(shù)據(jù)組織?粒度?分區(qū)?維度?元數(shù)據(jù)

2025-09-25 18:05

freepeople性欧美熟妇, 色戒完整版无删减158分钟hd, 无码精品国产vα在线观看DVD, 丰满少妇伦精品无码专区在线观看,艾栗栗与纹身男宾馆3p50分钟,国产AV片在线观看,黑人与美女高潮,18岁女RAPPERDISSSUBS,国产手机在机看影片

8-1數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘(完整版)

數(shù)據(jù)倉(cāng)庫(kù),聯(lián)機(jī)分析處理,數(shù)據(jù)挖掘datawarehousing,-資料下載頁(yè)

2數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘項(xiàng)目建設(shè)-講義-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘原理及應(yīng)用v-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘(陳志泊)——習(xí)題答案-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)數(shù)據(jù)據(jù)倉(cāng)庫(kù)原理-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘基礎(chǔ)第3章數(shù)據(jù)預(yù)處理(趙志升)-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)技術(shù)介紹(1)-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)3-數(shù)據(jù)倉(cāng)庫(kù)中的數(shù)據(jù)及組織-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘技術(shù)第6章數(shù)據(jù)預(yù)處理技術(shù)-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘第二章b-資料下載頁(yè)

數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘第一次作業(yè)-資料下載頁(yè)

8-1數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘(參考版)

8-1數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘-文庫(kù)吧資料

8-1數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘-展示頁(yè)

8-1數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘-在線瀏覽

8-1數(shù)據(jù)倉(cāng)庫(kù)與數(shù)據(jù)挖掘-閱讀頁(yè)