Escolar Documentos
Profissional Documentos
Cultura Documentos
1. Introduction 1
1.1 Purpose 1
1.2 Scope 1
1.3 Definitions, Acronyms, and Abbreviations. 1
1.3.1 API’s 1
1.3.2 DFD 2
1.3.3 STD 2
1.3.4 MVC 2
1.3.5 OS 2
1.3.6 SQL 2
1.3.7 SVM 2
1.3.8 DB 2
1.3.9 Codeigniter 3.0 2
1.3.10 PHP 2
1.3.11 ER Diagram 2
1.4 References Error! Bookmark not defined.
1.5 Overview 3
3. Specific Requirements 6
3.1 External Interface Requirements Error! Bookmark not defined.
3.1.1 System Interfaces Error! Bookmark not defined.
3.1.2 Interfaces Error! Bookmark not defined.
3.1.3 Hardware Interfaces Error! Bookmark not defined.
3.1.4 Software Interfaces Error! Bookmark not defined.
3.1.5 Communications Interfaces Error! Bookmark not defined.
3.2 Functional Requirements Error! Bookmark not defined.
3.2.1 User Login Error! Bookmark not defined.
3.2.2 View Account Error! Bookmark not defined.
3.2.3 Delete Post or Account Error! Bookmark not defined.
3.2.4 Initialize Tweet Error! Bookmark not defined.
3.3.1 Use Case #1 Error! Bookmark not defined.
3.3.2 Use Case #2 Error! Bookmark not defined.
3.4 Classes / Objects Error! Bookmark not defined.
3.4.1 User Controller Error! Bookmark not defined.
3.4.2 Tweet Controller Error! Bookmark not defined.
Tweet Storage and Analysis
4. Analysis Models 14
4.1 Sequence Diagrams 14
4.1.1 Sequence diagram for “User Login” 15
4.1.2 Sequence diagram for “Tweet Search” 16
4.1.3 Sequence diagram for “Tweet Store” 17
4.1.4 Sequence diagram for “Analysis Tweet” 18
4.2 Data Flow Diagrams (DFD) 19
4.2.1 DFD Level 0 20
4.2.2 DFD Level 1 20
4.2.3 DFD Level 2 21
4.3 State-Transition Diagrams (STD) 22
5. Supporting Information 24
Appendix A – Background Research on: 24
Appendix B – Data Dictionary 24
Appendix C – Proposal 26
ii
Tweet Storage and Analysis
1. Introduction
Social life in most of our University is diminishing because of some factor. These limitations
of social life have affected some of the students not to know how to relate in the outside world. In
Caritas University socialism is a problem. In building a social network website for the University will
make the University to solve the problem of the anti socialism. A social network is a social
structure made up of individuals (or organizations) called "nodes," which are tied (connected) by one
or more specific types of interdependency, such as friendship, kinship, common interest, financial
exchange, dislike, or relationships of beliefs, knowledge or prestige.
1.1 Purpose
The main objectives of the study are Computer science Social Network while the subsidiary
objectives are:
1.2 Scope
The scope of this study is centered on designing and automated and web-based application
for student online communication and reduction of anti socialism. The system is
strictly based on students, management and lecturers communication and online
seminars and suggestion and specially for making of friends.
1.3.1 API’s
Application Programmer Interfaces
1.3.2 DFD
Data Flow Diagrams
1.3.3 STD
State-Transitions Diagrams
1.3.4 MVC
MVC is model view controller. It has three parts Model, view and controller. Model
relates to database and use to store data of users. View shows the actual output of the user.
Controller controls the working of all systems. Controller is an interaction between user and the
system.
1.3.5 OS
Operating System
1.3.6 SQL
Structured query Language. That is used to store data
1.3.7 SVM
Support Vector Machine
1.3.8 DB
Database
1.3.10 PHP
Hypertext Preprocessor (Personal Home page) created by Rasmus Lerdorf in 1995.
1.3.11 ER Diagram
Entity Relationship diagrams. It is to represent the data and also describe how these data
are related with one another.
1.5 Overview
The Laboratory management system is built in order to replace manual based system to computerize.
Here system is expected to be efficient, useful and affordable on implementing tasks that is order by the
Lab in charge.
The important purpose of this system is that we make an application that collect tweets of
user and analyze them and show the results in a way as per user requirements. When
analyze any tweet it can classify into groups such as positive, negative and etc. The
system has three important components 1) User Interfaces 2) Components of System that
the user used 3) System Modules.Modules keep tweets processing, how data gathered and
store them, how the tweet analyzes and show to user. Tweets fetched thorough the
Twitter API’s which create the interaction between user and the Twitter. When system
analyze the tweet it will return some information about tweet such as name of user who
post the tweet, time of tweet, location of tweet, how many people re tweet on this tweet.
Different people have different opinion and they post tweets in different manners.The
product also calculates results on that tweets.
The main purpose of the application is to fetched the tweets of user from Twitter store the
tweets in the databases and analyze the tweets. We need some API’s that help us to
extract data from twitter and need a framework that analyzes those tweets. We use this
tweets to examine the human behavior and problem of user. The application will show
that the tweet is related with some emotions and some other objects. Tweets are classified
in different types such as positive feelings, negative feelings and political types. Then the
application will easily be extract data from that classify tweets that the user need.
Twitter is a Networking site where everyone shares their opinions by posting tweets. User
may be varies from simple user to into different actors. Everyone share their thoughts in a
whole day by tweets. So the main function of this application is to analyze the all types of
tweets that get from posts and show the result.
Twitter is used by every user. A user who used twitter is not necessary that the user
always a regular user. User characteristics may be changed into different categories such
as user may be a doctor, may be a student, may be a politicians, may be an film actor,
may be a star and maybe he or she is a simple user that do nothing but use twitter. All
users has own characteristics, opinions and thoughts. A user perform different task. Users
poststhe tweets, users re tweet to other tweet, users may be favorite the tweets, user may
be check the timelines of other users and users may be follow to other users. System will
be available for every user there is no restriction on user.
(1) Tweet must be in English and there must a hash tag according to need.
(2) Reply or comment is included.
(3) Picture tweets are not included.
(4) Tweet will be accessible according to the limitation of API.
(5) Tweets without hash tag not included.
(6) To collect tweets there will be a limit of number of tweets that will be stored
according to storage and API.
(7) First of all we will register a user to access our application and session will
be created.
(8) Without registration nobody will access application.
(9) User will access tweets data according to granted permission by the admin.
(10)API allows only 6 to 9 days keeping history of data.
2. A twitter account
3. If the user use mobile then the twitter application must be installed in the mobile.
3. Frameworks need where we write code that controls all the execution of system.
Sentiment analysis of tweets is assumed that the all tweets are neutral. Because the
objective of this SRS is to check that the people posts tweets are relate to emotions that
are define in the limitations. If any user post tweet that not relate to any emotion and he
or she cannot express their feelings in tweet then we can assumed that the collecting of
tweets is may be neutral [6].
3. Specific Requirements
This will be the largest and most important section of the SRS. The customer
requirements will be embodied within Section 2, but this section will give the
requirements that are used to guide the project’s software design, implementation, and
testing.
Another view sees system analysis as a problem-solving technique that decomposes a system into its
component pieces for the purpose of the studying how well those component parts work and interact to
accomplish their purpose. Analysis and synthesis, as scientific methods, always go hand in hand; they
complement one another. Every synthesis builds upon the results of a preceding analysis, and every
analysis requires a subsequent synthesis in order to verify and correct its results. The field of system
analysis relates closely to requirements analysis or to operations research. It is also “an explicit formal
inquiry carried out to help a decision maker identify a better course of action and make a better decision
than she might otherwise have made.
System analysis is the study of sets of interacting entities, including computer systems analysis. This field
is closely related to requirements analysis or operations research. It is also “an explicit formal inquiry
carried out to help someone identify a better course of action and make a better decision than he might
otherwise have made. System Analysis is a methodology that involves the application of systematic
approaches to collects facts about an existing system with the aim of improving it or replacing it with more
efficient system within the context of the available resources. In other words, System analysis can also be
viewed as the process of investigating a system, identifying problems and using the information to
recommend improvements to the system.
Significant amount of time is allocated for writing the order as the pharmacist needs to go through the
stock balance and make rough estimate for the amount to order based on Figures.
The state of drugs in stock is manually checked.
Mistake of selling expired drugs to customers.
Too much workload on employees
Filing cabinet in the pharmacy with paper record.
Analysis of Proposed System
From the problems listed in the existing system, the implementation of the proposed system shall focus
on;
Preliminary investigation plays an important role in developing a satisfactory requirement. Its’ as a result
of thorough investigation of how the current or the existing system works using the facts gathered at the
preliminary investigation that leads to focusing on the possibility of replacing the existing system or
improving upon the existing system. This task involves information gathering.
Collecting and analyzing existing materials on the project topic, written by different expert.
Studying the present system in detail and the organizational style.
Knowing and understanding the input and output processes of the existing system.
Interviews: A qualitative form of interview was conducted in the pharmacy to know the equipment
needed, and the mode of operation of the old system.
Primary data: This source has to do with the text book contacted for the development of this project.
3.3 System Design
Systems design could be seen as the application of systems theory to product development. There is
some overlap with the disciplines of systems analysis, systems architecture and systems engineering
System design is the process of defining the architecture, components, modules, interfaces, and data for
a system to satisfy specified requirements through system modeling. One could see it as the application
of systems theory to produce development. The design of this system will be user friendly. It shall be
designed in such a way that employees will be able to navigate easily through the information supplied on
the system.
In other words, system design consists of design activities that produce system specifications satisfying
the functional requirements that were developed in the system analysis process. System design specifies
how the system will accomplish. System design is the structural implementation of the system analysis
ER diagram is our logical database requirements. It is used to represent the attributes and
classes of our system. It can also use to show relationships.
ER-Diagram
User
User ID User Email User Name User Password Gender
Tweet
Analysis
Analysis-Tweet
T-id ID ID
Database
3.7Design Constraints
Specify design constraints that can be imposed by other standards, hardware limitations,
etc.
Core language will be PHP but we will follow the structure of Codeigniter 3.0 in MVC.
First of all we will search tweets from twitter of an event, occasion etc. All the tweets
with hash tag will be available to search without hash tags no tweets will be searched.
Tweets with hash tag will be stored in database that later will be used for analysis.
4. Analysis Models
4.1 Sequence Diagrams
Sequence Diagram shows how events move from object to object to object. How events
cause flow from one object to another in the time period. Sequence diagram is a short
representation of use case activities.Once a complete sequence diagram has been
developed, all of the events that cause transitions between system objects can be collated
into a set of input eventsand output events (from an object). This information is useful in
the creation of aneffective design for the system to be built. Different symbols used for
this purpose.
Works as an actor
Synchronization of message
Reply of message
It is used for to store data and tables names that uses in database.
Diagram #2
Initial state
Final state
Decision symbol
Synchronization state
Transition symbol
5. Supporting Information
DFD stands for Data Flow Diagram. DFD is a graphical representation of flow of data
using some components and objects. It can also use for the visualization of data. A DFD
describe the information of the data where the data comes from, where will go and where
will be store.
DB stands for Database. Database is the major components of any system or software
because it used for store data.Database contains tables where data store in the form of
rows and columns.
Framework is a structure indicates that how program built and how the components of
program interact with one another.Framework includes actual coding of program and
interface of programming. It is also called Skelton of the system.
Git-Hub is a web-based software system that is to store online coding of our application.
We can access of our whole application code just by use them software.
Hash tag is a word that is defines with a hash symbol (#). It is used on social sites such
as Twitter to detect a message and post on a specific topic.
Codeigniter 3.0 is PHP framework that is used for develop the web applications by
follow the MVC (model view controller) structure. Codeigniter source code hosts by Git-
Hub.
MVC stands for Model View Controller. It is containing three parts model, view and
controller. Model is used to relate database. View is used to show the output. Controller
is used to control the working of the system and is also used to show the interaction
between the model and view.
PHP stands by Hypertext Preprocessor. PHP is a most popular language that is mostly
used for web development applications.
SQL is structured Query Language. SQL is used for communication with the databases
through SQL queries. It is a language that is actually used for relational database
management system. We use different type of queries to extract data from database.
SRS stands for Software Requirement Specification. It is written document of system that
can we developed. It is include functional and non-functional requirements about the
system. It is also include some diagrams that show the interaction of user and system.
STD is State Transition Diagram. It is consisting number of states called finite states
diagrams. It is a dynamic flow of system state by state.
SVM stands for Support Vector Machine. It is used to analyze the data by using some
algorithms.
Use Case is a list of events or actions and interactions between the user and the system. It
is contain many use cases that the user performs in some different environment.
Appendix C – Proposal
Abstract
Surveys are very popular for collecting data but it is very costly. So we suggest social media
analysis using different social websites. For example: Facebook, Twitter, Gmail, etc. In my
research work I will use Twitter. In which firstly collect the ‘‘tweets’’ and analyze Tweets in
different manners. When any user post the tweet on Twitter then store the tweets of user and
analyze it for checking that the tweet relate with happy mood, sad mood, politics and relate to
some other event. For this purpose we will use different tools and framework relate to this
analysis. We also use Database to store our tweets. Tweets are used for information from
different sources and also used to share experiences and opinions on different topics. By posting
tweets on Twitter millions of users share own opinions on different aspects of life every day.
People tweets according to their own thoughts what he wants to share and what he can feel?
Twitter is most popular website to share opinions and thoughts seconds by seconds. We show
how to collect the tweets and how to analyze that tweet and show the result about that tweet and
using different tools and framework we perform better than previously proposed methods. This
research explores the use of social media website Twitter to examine when user tweet or Re
tweet then we analyze where the user from, which country and what’s the location where the
user use the social media.
1. Introduction
Twitter is a social service that has rapidly become the social media service that is mostly used in
strategic communication campaigns [1]. Twitter is a network that enables the users to share
information or share their thoughts using private and public messages and networks. User tweets,
Re tweet, favorite, or follow by using their cell phones and their laptops and by other device as a
text message. Ability to communicate is real-time with short messages every time in a day.
Social media, including Twitter, are said that it can be used for collect information about any
occasion and about any incident like earthquake and communication among family and friends.
Consequently, users activity logs on Twitter exist in the form of tweets. Therefore it is easily to
understand to observe that what happened with the user by analyzing his or her tweets. We also
detect that what happened with the user and what happened with the network by analyzing
tweets. With the help of Tweets we analyze what’s the problem in that country for that reason the
network problem exists. Tweets can be used to examine the human behavior.
To post tweets, we use three types of media: PCs, mobile phones, and smartphones. Social
amplification of risk describes that psychological, social, and cultural factors interact with
emergency events or attenuate risk perceptions [2]. New methods such as data mining, aggregate
the data, and analyze the data from social sites in real-time are available [3]. Twitter
(www.twitter.com) is very popular and suitable for data storage and analysis the data of the users
who use Twitter and post tweets according to their own requirements. Users update their status
by tweets and share with followers and friends on different emotions (e.g. thoughts, feelings,
activities, and opinions).
Social websites are very popular among Internet users. Millions of messages and posts of million
users are collect daily in social web-sites such as Twitter [4], Tumbler [5], and Facebook
[6].Users write about their own life, share thoughts, share own experiences on different topics
and discuss current issues about media, marketing and country issues. Users also post about their
products that they will use and share jobs experiences. A user can express their political and
religious views. We collect all those data that used for marketing and social analysis thatshow
the behavior of the user that he or she adopts in the whole day.
We use a dataset formed of collected messages from Twitter. Tweets contain a short text or long
text created by the user. Table 1 shows the examples of tweets post on Twitter by user.
saaan: @Chicago Hopefully the day of games are best day for Chicago.
brookly: I love the sound my mobile when I unlock my phone and enter the password. Tick
Tuck Tock.
john: I like the way when Google change their appearance in every day it was very decent.
Twitter’s users change from regular users to celebrities, company representatives, politicians [7],
and even country presidents. Therefore, the data that will collect relate to different social and
different interests groups. Twitter’s user may be represented from many countries [8]. We
collected text from posts on Twitter in three types of texts:
1. Posts of data which contain positive feelings, such as happiness, love or joy.
2. Literature Review
Researchers are continue to show a strategies used by companies to develop relation with users,
workers and developers. Strategies used to examine the nature and behavior of the messages that
the users communicate with their friends and followers on social websites. While [9] dialogic
principles and [10] relationship strategies discuss open. When social media analysis is not
available then we don’t know the behavior of the user and not know what they said in
conversations.
With the help of social networks, opinion and data mining analysis became a very interesting
field for many users. They can easily analyze the data of the user and understand the behavior of
the user and understand what the meaning of the user conversation. A very broad overview of the
existing work was presented in [11]. In their survey, the users explain the existing techniques and
approaches for the tweet storage and analysis. They describe their opinions on that information
that they collect from different posts on Twitter. However, for this purpose we use many Twitter
API’s that help us to collect data corpus of text posts and convert it into three types of classes.
These classes help us to analyze our data in a better and efficient way. Three classes are: positive
emotions, negative emotions, and a set of texts that has no emotions (relate some other).
To collect the positive and negative classes, we followed the same method that used in [12], [13].
Other positive sets are many, much, more and best. Negative sets are the opposite of the positive
sets. Negative sets show the loss or disappointment of user. Such words used mostly to express
negative feelings like: “missed”, “bored”, “gone”, “lost”, etc.
As [14] research on the usage of social media cover a range of social media sites including social
networking sites, photo editing and education. We define the most frequent users who use
Twitter and post frequently many tweets per day. We also collect the data of those users who
post tweets per day in a certain manner. We choose frequent Twitter usersfrom all users we
crawled. We extract tweets of frequent Twitter users and store into dataset assemble the tweets in
an efficient manner so that we know easily which tweet relate to which emotion and then analyze
the tweets according to requirement. We count the tweets by the manner of time as per hour;
minute and seconds and normalize these tweets for analysis.
3. Problem Statement
Our problem statement is very simple. We want to analyze the tweets on the bases of hash tag
different types of analysis will be done later on.
(c) Showing that the tweet is in happy manner, sad manner or relate to politics and
relate to some other event.
4. Research Methodology
To attain our goal we will use Codeigniter Framework. Codeigniter Framework use MVC
structure. To understand the Codeigniter framework working first we will understand what MVC
is:
The Model-View-Controller (MVC) is divided into three parts: model, view, or controller. These
steps define not only the roles of application but also define the communication of objects with
each other. MVC has define the predefined path for communication and define how control
move from page to page. It describes the path or routes for users. Routes define that how the user
request processed and view the required information to the user. MVC has three steps:
Step 1: Model
The model manages the data through the database. All information and data that required by the
user or user request to controller the model provide to controller and controller process the
information and view them. The main role of model is describe the relation of data that one-to-
many, many-to-one and many-to-many.
Step 2: View
The view shows the appropriate message, output and information to user. When a user request it
will receive the controller and then send to model to get data that the user wants. Model returns
the data to controller and controller proceed into actual output and view to the user.
Step 3: Controller
Each and every thing controlled by the controller. It manipulates and controls the user requests
and also converts that requests into actual output.
We will also use Codeigniter Framework the one of the top framework used these days. There
are different versions of Codeigniter available but we will use Codeigniter 3.0 and to access the
tweeter API and for authentication we will use socialite and j7mbo these do will do
authentication and API access work for our application and these are most common and widely
used authentication tools. J7mbo is a tool that uses to access the API’s.
5- References
[1] Stelzner, M.A. (2009). Social media marketing industry report. Retrieved online February
26, 2010 from http://www.socialmediasummit09.com/.
[2] Kasperson R, Renn O, Slovic P, Brown H, Emel J, et al. (1988) The socialamplification
of risk: a conceptual framework. Risk Analysis 8: 177–187..
[3] Eysenbach G (2006) Infodemiology: Tracking flu-related searches on the web for
syndromic surveillance. AMIA 2006 Symposium Proceedings. pp 244–248.
[4] http://twitter.com
[5] http://tumblr.com.
[6] http://facebook.com
[7] http://www.sysomos.com/insidetwitter/politics
[8] http://www.sysomos.com/insidetwitter/#countries
[9] Kent, M. L., & Taylor, M. (1998). Building dialogic relationships through the World
Wide Web. Public Relations Review, 24(3), 321–324.
[10] Hon, L. C., & Grunig, J. E. (1999). Guidelines for measuring relationships in public
relations. Gainesville, FL: Institute for Public Relations.
[11] Bo Pang and Lillian Lee. 2008. Opinion mining and sentiment analysis. Found. Trends
Inf. Retr., 2(1-2):1–135.
[12] Jonathon Read. 2005. Using emoticons to reduce dependencyin machine learning
techniques for sentiment classification.In ACL. The Association for Computer
Linguistics.
[13] Alec Go, Lei Huang, and Richa Bhayani. 2009. Twittersentiment analysis. Final Projects
from CS224N forSpring 2008/2009 at The Stanford Natural LanguageProcessing Group.
[14] Kireyev, K., Palen, L. & Anderson, K. (2009) ‘Applications of Topics Models toAnalysis
of Disaster-Related Twitter Data’, , Whistler, Canada, 12 November 2009.