Found 27 repositories(showing 27)
yuan-more
2023最新最强大数据面试宝典,附答案解析,涵盖:一、Hadoop 二、Hive 三、Spark 四、Kafka 五、HBase 六、Flink 七、Clickhouse 十、数据湖 十一、必备SQL题 十二、必备算法 十三、大数据算法设计题
sahilchavhan
"Comprehensive notes on Big Data analysis, covering key concepts, tools, and techniques. Dive into the world of data analytics with this curated repository. Explore topics like Hadoop, Spark, and data visualization. Your go-to resource for mastering Big Data analytics. 🚀 #BigData #Analytics"
cordeirotelecom
No description available
heike
Midwest BigData Summer School: Website for R related topics https://heike.github.io/summerschool-2022/
viccanto
soc14lmed14 and c0d3 is an idea to bring the information to the client both at user or professional level, you want to develop an app for your business or a web site, you want security and storage in the cloud, you like social networks, ecomerce, studies sem seo, you like electric devices and you have a last generation phone, but you see programming a slab, you know everything that your phone has already installed and you squeeze it to the maximum, you have heard about digital currencies bitcoin pound, grow hacking and digital marketing, yes, but I already have a website, it is safe, or not, it gives me all the services that are easily implementable, digital signatures and ssl services, cloud storage, management app, my wifi is perfectly configured, or no, security is not for me, privacy and your rights to the data that you have at the mercy of anyone are not important, yes, ... we introduce tools that are revolutionizing the markets before the legacy of the bigdata but also soc14lmed14 on the other hand has its own themes and opinion, daily development and cordiality as much as possible with its themes and opinions, on the part of `n c0d3 brings you its informatics topics that you have at your disposal
hyu-leeky
A repository to share course materials in the KMU CS Hot Topics in BigData Processing
Zandex
No description available
TiagoAragao1
No description available
letdevx
No description available
Mateo-RH
Bitacora de bigdata para la materia Topicos de Telematica
MarcosCordeiro
No description available
giraldodiego
No description available
sakshi-mehta07
Analyzing Reddit’s Climate Change Discourse: A Distributed Study of Topic Modeling and Sentiment Analysis Techniques using GCP and PySpark
KelvinjArruda
Projeto de extensão
No description available
Thiagocsoaresbh
No description available
rafaelmarquesRM
No description available
abinashg2002-creator
No description available
hebaabbadi
No description available
AdrianoJesusDeveloper
Este projeto de extensão foi desenvolvido como parte de uma parceria entre a Faculdade Estácio e a Cooperativa Cocatrel. O objetivo é criar um **dashboard interativo** para a gestão de dados de ponto dos funcionários da cooperativa, permitindo uma análise mais eficiente e precisa das informações.
gabriel-anjos
Repositorio destinado ao projeto de extensão sobre topicos de BigData com ênfase nas funcionalidades da biblioteca pandas
sky1307
No description available
kaiogva
No description available
EmmaCojbasic
A student project done for the Big Data Systems course at the Faculty of Electronic Engineering
MIKUAFANS
[IEEE BigData 2025] SciTopic: Enhancing Topic Discovery in Scientific Literature through Advanced LLM
PallaviChavan07
presentations on following topics: 1. Introduction to Recommender System 2. BigData and Hadoop Technologies 3. Advanced SQL Injection in SQL Server Applications
The rise of different topics all over the interneton a daily basis is increasing rapidly which leads to a seriesof fluctuating data in all areas of the world and following upa particular topic and getting related information regardingthe topic is essential to keep with the trend. Most of thesetopics trend with their related hash tags on twitter on a dailybasis, and the data related to the trending topics are vastand require a capable and efficient framework to stream,analyse and cluster the topics based on the topic’s hash tag. Bigdata frameworks such as Apache Spark has high computingcapacity to manage such big data at a faster rate in an efficientway.The challenge of analyzing the trending topics on twitterfor real-time topic clustering to get a clear and only relatedtweets and information regarding a topic is the motivation forthe following applied clustering techniques applied using ApacheSpark and clustering algorithms. The clustering algorithm isconstructed using Spark LDA (Latent Dirichlet Allocation),the algorithm takes the live twitter stream data as an inputand the data is Represented using a vector space model,thenon-negative dimension weights highlight the significance of thein accordance term functions, one essential assets of the sortof function space is high dimensions which occurs.The LDAalgorithm takes the approximate assumed topics in the documentand will assign every word in the document to a temporarytopic using LDA which is a probabilistic model that posits aset of global topics and a set of document topics, the LDAprocess is applied iterative by loop each word in the document and update the topic assignment based on the criteria established.
All 27 repositories loaded