Skip to content

meowpunch/ApacheSparkWithScala

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Apache Spark with Scala

Data

  • social network

    • fakefriends.csv: id, name, age, # of friends
  • e-commerce

    • customer-orders.csv: customerId, productId, priceId
  • superhero

    • Marvel-names.txt: heroId, heroName
    • Marvel-graph.txt: heroId, connectionId...

Exercise

Predict Real Estate Values with Decision Tree Model

predict real estate values per unit area base on several features

  • features(X): HouseAge, DistanceToMRT, NumberConvenienceStores, Latitude, Longitude
  • label(y): PriceOfUnitArea

Realtime processing with Twitter Streaming data

Twitter API

CHANGELOG, 15 Nov 2021 : Today, we are announcing that Twitter API v2 is now the primary version of the Twitter API. We have launched enough endpoints and functionality into Twitter API v2 to satisfy the needs of 90% of all existing Apps built on the Twitter API.

  • sign up Twitter Developer site and get Keys
    • consumerKey: API Key
    • consumerSecret: API Secret
consumerKey 12345
consumerSecret 12345
accessToken 12345
accessTokenSecret 12345 

Reference

Releases

No releases published

Packages

No packages published

Languages