【正文】
s patterns, acceptable caching and consistency policies, security concerns, and fault tolerance requirements. To provide the desired performance, security, and reliability to a grid application, data provisioning needs to be optimized according to the application39。 本論文提出的數(shù)據(jù)管理服務(wù)來管理每個(gè)應(yīng)用程序 GVFS中會(huì)話,執(zhí)行獨(dú)立會(huì)話之間的隔離,并為每個(gè)會(huì)話申請(qǐng)所需的定制。然而,在大型系統(tǒng)中,許多動(dòng)態(tài)數(shù)據(jù)會(huì)話管理是另一個(gè)具有挑戰(zhàn)性的任務(wù),因?yàn)樗鼜?fù)雜性。因?yàn)橐粋€(gè)應(yīng)用程序定制的優(yōu)化(例如,有進(jìn)取心的預(yù)取文件內(nèi)容),可能會(huì)導(dǎo)致性能(例如,其他幾個(gè)稀疏文件退化,數(shù)據(jù)庫),應(yīng)用量身定做的功能通常沒有實(shí)現(xiàn)通用 O / S的內(nèi)核。由于眾所周知的 DFS的界面是由 GVFS中保存并提交給應(yīng)用程序,沒有修改要求他們的源代碼 ,庫或二進(jìn)制文件。 在網(wǎng)格式環(huán)境中的 DFS為基礎(chǔ)的辦法,它也被 成功在生產(chǎn)電網(wǎng)系統(tǒng)部署了數(shù)年,支持科學(xué)工具和許多學(xué)科的用戶。第二,寬領(lǐng)域,跨應(yīng)用程序域環(huán)境必要定制的優(yōu)化數(shù)據(jù)訪問,以解決效率低下(網(wǎng)絡(luò)延時(shí)長,有限的網(wǎng)絡(luò)帶寬) , 不安全(不安全的資源,有限的互不同域之間的信任),以及 不安 全(不可靠機(jī)器和網(wǎng)絡(luò))是在這種環(huán)境中的典型。 ?活力:在網(wǎng)格式的環(huán)境中部署的系統(tǒng)具有高度的活力。例如 科學(xué)網(wǎng)格 ,指 幾個(gè)機(jī)構(gòu)之間資源利用協(xié)調(diào)解決問題,企業(yè)信息系統(tǒng),從多個(gè)站點(diǎn)聚合協(xié)同努力發(fā)展。 ?規(guī)模:大量資源可以在網(wǎng)格式的環(huán)境中匯總 。特別是,該系統(tǒng)已作出以下貢獻(xiàn): ?它提供點(diǎn)播,跨域數(shù)據(jù)訪問透明的未修改應(yīng)用和 O /S用戶級(jí)別廣泛使用的 O / S 級(jí)的虛擬化基礎(chǔ)分布式文件系統(tǒng)( DFSs)。相反,數(shù)據(jù)網(wǎng)格管理需要專門處理這些獨(dú)特的問題。此外,用戶級(jí)別的增強(qiáng)為網(wǎng)格環(huán)境而設(shè)計(jì)風(fēng)格是建立在虛擬層,使數(shù)據(jù)配置與應(yīng)用程序所要求的特點(diǎn)。為了解決這個(gè)問題,用戶級(jí)別的 DFS支持自定義,則建議應(yīng)用定制的 GVFS中數(shù)據(jù)會(huì)話。動(dòng)態(tài)改變應(yīng)用程序的工作量和資源進(jìn)一步要求提供的數(shù)據(jù)會(huì)話連續(xù)監(jiān)測(cè),及時(shí)適應(yīng)它們的配置。為了進(jìn)一步降低管理數(shù)據(jù)會(huì)話的人為干預(yù),使他們迅速適應(yīng)不斷變化的環(huán)境,植物神經(jīng)功能內(nèi)置于數(shù)據(jù)管理服務(wù),使他們自動(dòng)監(jiān)測(cè),分析能力 , 優(yōu)化電網(wǎng)范圍內(nèi)的數(shù)據(jù)會(huì)分散的實(shí)體,合作共同努力實(shí)現(xiàn)所需的數(shù)據(jù)配置和資源使 用的目標(biāo)。s point of view, it is desired that the job execution is fast, secure, and reliable。 the applications also have diverse characteristics (., data access pattern) and needs (., desired data access performance, security, and reliability). Dynamism: Systems deployed in a gridstyle environment are highly dynamic. Failures on machines and works can happen at any time, and nondedicated resources may dynamically join and leave the system. On the other hand, applications are started and terminated on demand, and their workloads also vary over time. Scale: Large amounts of resources can be aggregated in a gridstyle environment. They are distributed across diferent institutions and connected on widearea works, providing the puting power and storage capacity to support executions of many applications. This dissertation focuses on two specific aspects of data management in distributed systems: data provisioning — providing applications running on the puting resources with remote access to their data stored on the storage resources, and the management of the data provisioning — the establishment, configuration, and termination of the remote data access. Computing in a gridstyle environment poses unique challenges to these tasks because of the above mentioned heterogeneous, dynamic, and largescale nature of applications and resources. First, the diversity of applications and resources motivates a data provisioning solution that can be transparently deployed, without modifying the existing operating systems (O/Ss) and changing the application source code or binaries. Second, the widearea, crossdomain environments necessitate applicationtailored optimizations for data access to address the inefciency (long work delay, limited work bandwidth), insecurity (insecure resources, limited mutualtrust between diferent domains), and unsafety (unreliable machines and works) that are typical in such environments. Last but not least, the management of data provisioning in a large, dynamic system also desires ?exible control and automatic optimization of the